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Local Distribution and the Symmetry Gap: 
Approximability of Multiway Partitioning Problems* 

Alina Ene' Jan VondrakJ Yi WiA 


Abstract 

We study the approximability of multiway partitioning problems, examples of which include 
Multiway Cut, Node-weighted Multiway Cut, and Hypergraph Multiway Cut. We investigate these 
problems from the point of view of two possible generalizations: as Min-CSPs, and as Submodular 
Multiway Partition problems. These two generalizations lead to two natural relaxations, the Basic 
LP , and the Lovasz relaxation. The Basic LP is generally stronger than the Lovasz relaxation, but 
grows exponentially with the arity of the Min-CSP. The relaxations coincide in some cases such as 
Multiway Cut where they are both equivalent to the CKR relaxation. 

We show that the Lovasz relaxation gives a (2 — 2//c)-approximation for Submodular Multiway 
Partition with k terminals, improving a recent 2-approximation [4]. We prove that this factor is 
optimal in two senses: (1) A (2 — 2 /k — e)-approximation for Submodular Multiway Partition with 
k terminals would require exponentially many value queries. (2) For Hypergraph Multiway Cut 
and Node-weighted Multiway Cut with k terminals, both special cases of Submodular Multiway 
Partition, we prove that a (2 — 2 /k — e)-approximation is NP-hard, assuming the Unique Games 
Conjecture. 

Both our hardness results are more general: (1) We show that the notion of symmetry gap , 
previously used for submodular maximization problems [Ml IS], also implies hardness results for 
submodular minimization problems. (2) Assuming the Unique Games Conjecture, we show that 
the Basic LP gives an optimal approximation for every Min-CSP that includes the Not-Equal 
predicate. 

Finally, we connect the two hardness techniques by proving that the integrality gap of the 
Basic LP coincides with the symmetry gap of the multilinear relaxation (for a related instance). 
This shows that the appearance of the same hardness threshold for a Min-CSP and the related 
submodular minimization problem is not a coincidence. 


1 Introduction 

In this paper, we study the approximability of multiway cut/partitioning problems, where a ground 
set V should be partitioned into k parts while minimizing a certain objective function. Classical 
examples of such problems are Multiway Cut (that we abbreviate by Graph-MC), Node-weighted 
Multiway Cut (Node-Wt-MC) and Hypergraph Multiway Cut (Hypergraph-MC). These problems 
are NP-hard but admit constant-factor approximations. 

*This is a full version of the paper that appeared in ACM-SIAM SODA 2013 [9]. 
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Multiway Cut (Graph-MC): Given a graph G = (V,E) with weights on the edges and k terminals 
t\, t 2 , ■ ■ ■ , tfc G V, remove a minimum-weight set of edges so that every two terminals are disconnected. 

Node-weighted Multiway Cut (Node-Wt-MC): Given a graph G = (V,E) with weights on the 
nodes and k terminals ti,t 2 , ■ ■ ■ ,tk G V, remove a minimum-weight set of vertices so that every two 
terminals are disconnected. 

Hypergraph Multiway Cut (Hypergraph-MC): Given a hypergraph H = (V, E) with weights on 
the hyperedges and k terminals t\,t 2 , ■ ■ ■ , ifc G V, remove a minimum-weight set of hyperedges so that 
every two terminals are disconnected. 

Although the problems above are formulated as vertex/edge/hyperedge removal problems, Graph- 
MC and Hypergraph-MC can be also viewed as partitioning problems where vertices are assigned 
to terminals and we pay for each edge/hyperedge that is cut between different terminals. The Node- 
Wt-MC problem can also be stated in this form, and in fact shown to be approximation-equivalent to 
Hypergraph-MC (although the reduction is more complicated, see [193). Given this point of view, 
and the fact that the cut function in graphs/hypergraphs is submodular, the following generalization 
of these problems was proposed in [26] and recently studied in urn- 

Submodular Multiway Partition (Sub-MP): Given a submodular set function f : 2' - —>• M + and 
k terminals t\, t 2 , ■ ■ ■, tk G V, find a partition of V into A\,... , such that ti G A, and Yli =1 /(Tl?) 
is minimized. 

This problem captures the problems Graph-MC, Node-Wt-MC and Hypergraph-MC as spe¬ 
cial cases0 One of the useful aspects of viewing the problems in this more general framework is 
that non-trivial linear programs, discovered on a case-by-case basis in the past, can be viewed in a 
unified way: by considering the Lovasz extension of a submodular function (see Section [2] for details). 
In particular, [5] rederives the geometric CKR relaxation for Multiway Cut [2j in this way. It was 
shown in [3] that Sub-MP is not significantly more difficult than its special cases: Sub-MP admits a 
2-approxinration in general, and an improved (3/2 — l//c)-approximation when the number of termi¬ 
nals is k and / is a symmetric submodular function (in the sense that f(S) = /(S); we denote this 
special case as Sub-MP-Sym). We remark that Graph-MC is a special case of Sub-MP-Sym, while 
Node-Wt-MC and Hypergraph-MC are not. 

This compares to the approximability of the classical problems as follows: Graph-MC admits a 
1.2965-approximation, which has been obtained by a sequence of successive improvements 0 da eei], 
all based on the CKR relaxation of [2]. The (3/2 — l//c)-approximation obtained by [2] matches 
the result of [3j for Sub-MP-Sym. On the hardness side, it was proved that the CKR relaxation 
provides the optimal approximation factor for Graph-MC, assuming the Unique Games Conjecture 
m- However, the actual approximation factor is not known: it is only known that it is between 8/7 
and 1.2965. In the case of 3 terminals, the optimal factor is known to be 12/11 [121 [17] . 

The problems Node-Wt-MC and Hypergraph-MC are known to be approximation-equivalent, 
and both admit a (2 — 2//^-approximation for k terminals [191 10]. It is known that a (2 — e)- 
approximation independent of k would imply a (2 — e)-approximation for Vertex Cover, which would 
refute the Unique Games Conjecture. Therefore, we do not expect an approximation better than 
2 for Node-Wt-MC and Hypergraph-MC when the number of terminals is large. Nevertheless, 
this reduction does not give any hardness for a constant number of terminals k, and the optimal 
approximation for a given k was not known. 

x We point out that in the case of Hypergraph-MC, the reduction is not as direct as one might expect - this is 
due to the fact that we want to count each cut hyperedge only once, independently of how many terminals share it. 
Consequently, the arising submodular function is not symmetric, although the cut function in a hypergraph is. 
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Our contribution. We study these partitioning problems from two points of view: (a) as general 
partitioning problems with a submodular cost function, using a natural convex program for the problem 
based on the Lovasz extension of a submodular function; we refer to this convex program as the Lovasz 
relaxation ; (b) regarding them as Min-CSPs (constraint satisfaction problems), which leads to another 
natural relaxation that has been referred to as the Basic LP (see, e.g., mii22])@ Our concrete results 
are as follows. 

Concrete results: 

• We give a (2 — 2/£:)-approximation for the Sub-MP problem with k terminals using the Lovasz 
relaxation. We also show that this is optimal, in two different senses. 

• We prove that any (2 — 2 /k — ^-approximation for Sub-MP (for a constant number of terminals 
k) requires exponentially many value queries in the oracle model. 

• We prove that for Node-Wt-MC, a special case of Sub-MP, it is Unique-Games-hard to achieve 
a (2 — 2 /k — e)-approximation (for a constant number of terminals k). 

Since Hypergraph-MC is approximation-equivalent to Node-Wt-MC, we determine the approx- 
imability of all three problems, Sub-MP, Hypergraph-MC and Node-Wt-MC, to be exactly 2—2 /k 
(assuming the Unique Games Conjecture in the case of Hypergraph-MC and Node-Wt-MC). 

Unique Games-hardness vs. NP-hardness more generally: Our hardness proofs in fact lead 
to more general results, revealing an interesting relationship between the Unique GAMES-hardness of 
Min-CSP problems and NP-hardness of their natural submodular generalizations. 

• We show a Unique GAMES-based hardness result for Min-CSP problems, generalizing the ma¬ 
chinery of m- Roughly speaking, we show that for every Min-CSP problem that includes 
the Not-Equal predicate, the integrality gap of the Basic LP can be translated to a Unique 
GAMES-hardness result. 

• We show how the symmetry gap technique, previously developed for submodular maximization 
problems [23], applies to submodular minimization problems. This technique yields hardness re¬ 
sults in the value oracle model, or computational hardness assuming NP ^ RP, using the tech¬ 
nique of [8]. In particular, we prove that it is hard to achieve a better than 1.268-approximation 
for Sub-MP-Sym (a special case of Sub-MP where the cost function is symmetric submodular). 

Finally, we present a connection between the two approaches, proving that the integrality gap of the 
Basic LP coincides with the symmetry gap of the multilinear relaxation (see the discussion below and 
Section [6] for more details). 

Discussion. Let us comment on some connections that we observed here. 

Integrality gap vs. symmetry gap. While Unique GAMES-hardness results typically start from an 
integrality gap instance, hardness results for submodular functions often start from the multilinear 
relaxation of a problem, exhibiting a certain symmetry gap (see mm- This is a somewhat different 
concept, where instead of integral vs. fractional solutions, we compare symmetric vs. asymmetric 
solutions. In this paper, we clarify the relationship between the two: For any integrality gap Min-CSP 
instance of the Basic LP, there is a related Min-CSP instance that exhibits the same symmetry gap 
in its multilinear relaxation. Conversely, for any symmetry gap instance of the multilinear relaxation 
of a Min-CSP instance, there is a related Min-CSP instance whose Basic LP has the same integrality 
gap (see Section [6]). Therefore, the two concepts are in some sense equivalent (at least for Min-CSP 

2 In the ACM-SIAM SODA 2013 version of this paper, we referred to this LP as Local Distribution LP. 
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problems). This explains why the Unique GAMES-hardness threshold for Hypergraph-MC and the 
NP-hardness threshold for its submodular generalization Sub-MP are the same. 

Lovasz vs. multilinear relaxation. The fact that the symmetry gap technique gives optimal results for 
a submodular minimization problem is interesting: The symmetry gap technique is intimately tied to 
the notion of a multilinear extension of a submodular function, which has recently found numerous 
applications in maximization of submodular functions [23l fH . 151 FT5l 6j. Nevertheless, it has been 
common wisdom that the Lovasz extension is the appropriate extension for submodular minimization 
PS OH Eli]. Here, we obtain a positive result using the Lovasz extension, and a matching hardness 
result using the multilinear extension. 

Erratum. In the conference version [9], it was claimed that for every fixed k > 1 and e > 0 there is 
an efficient algorithm to round the Basic LP for a Min k -CSP instance, achieving an approximation 
factor within e of the integrality gap. This result was found to contain errors and has been removed. 

Organization. The rest of the paper is organized as follows. In Section [2J we discuss the Lovasz 
relaxation, and show how it yields a (2 — 2/fc)-approximation for the Sub-MP problem. In Section [3j 
we present the symmetry gap technique for submodular minimization problems, and show how it 
implies our hardness results in the value oracle model. In Section 01 we present our hardness result for 
Min-CSP, and show how it implies the hardness result for Hypergraph-MC. In section El we discuss 
the relationship of the Lovasz relaxation and the Basic LP. In Section S we discuss the relationship 
of integrality gaps and symmetry gaps. 

2 Approximation for Submodular Multiway Partition 

In this section, we revisit the convex relaxation proposed by Chekuri and Ene |[5], and provide an 
improved analysis that gives the following result. 

Theorem 2.1. There is a polynomial-time (2 — 2 /k)-approximation for the Sub-MP problem with k 
terminals, where k and the terminals t\,...,tk are given on the input and the cost function is given 
by a value oracle. 

The Lovasz relaxation. The following is the convex relaxation that has been used by Chekuri and 
Ene: 


(SubMP-Rel) 
Vj € U; 
Vi € [fc]; 
Vi, j; 


min E*=i /( x i) : 
Xu=i x i,j = 
xi,n = i; 
x i,j — 9* 


Here, /(x,;) denotes the Lovasz extension of a submodular function. The function / can be defined 
in several equivalent ways (see urn- One definition is based on the following rounding strategy. We 
choose a uniformly random 6 € [0,1] and define Ai(9) = {j : Xij > 9}. Then /(xj) = E[/(A(0))]. 
Equivalently (for submodular functions), / is the convex closure of / on [0, l]' 7 . The second defini¬ 
tion shows that the relaxation SubMP-Rel is a convex program and therefore it can be solved in 
polynomial time. 

Given a fractional solution, we use the following randomized rounding technique, a slight modifi¬ 
cation of one proposed by Chekuri and Ene: 

Randomized rounding for the Lovasz relaxation. 
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• Choose 9 £ ( 2 , 1 ] uniformly at random and define Ai{9) = {j : Xij > 9}. 

• Define U(9) = V\ UjLi Ai(9) = {j : maxj < 9}. 

• Allocate each Aj{9) to terminal i, and in addition allocate U(9) to a terminal i! chosen uniformly 
at random. 

Each terminal t, is allocated to itself with probability one. Moreover, the sets Aj(9) are disjoint 
by construction, and therefore the rounding constructs a feasible solution. The only difference from 
Chekuri and Ene’s rounding [3] is that we assign the “unallocated set” U{9) to a random terminal 
rather than a fixed terminal. (However, taking advantage of this in the analysis is not straightforward.) 
We prove the following. 

Theorem 2.2. The above rounding gives a feasible solution of expected value at most (2—|) Yli=i /( x «)- 

This implies Theorem 12.11 In the following, we prove Theorem 12.21 We assume that /(0) = 0. 
This is without loss of generality, as the value of the empty set can be decreased without violating 
submodularity and this does not affect the problem (since terminals are always assigned to themselves). 
We start by defining several sets, parameterized by 9, that will be important in the analysis. 


• Ai{9) = {j : x^ > 9} 

• A{9) = \Ji =1 Ai(9) = {j : maxiXij > 9} 

• U(9) = V \ A(9) = {j : max* x^ < 9}. 

• B(9) = U(1 — 9) = {j : 1 — riiaxj x tJ > 9}. 

We can express the LP cost and the cost of the rounded solution in terms of these sets as follows. 
The following lemma follows immediately from the definition of the Lovasz extension. 

Lemma 2.3. The cost of the LP solution is 


k x 

LP = E/ f(M0))d9. 
i =1 J o 


The next lemma gives an expression for the expected value achieved by the algorithm in a form 
convenient for the analysis. 

Lemma 2.4. The expected cost of the rounded solution is 


/ 2 \ r 1 0 k r 1 / 2 

ALG=l 2 -t)J 2 f(Ai(9))d9 + - £ / f(A(9) U B{9))d9. 

k / i=\ J 1 / 2 K i=i A 

Proof. The set allocated to terminal i is Ai{9) with probability 1 — 1 /k, and A^(9) U U(9) with 
probability 1/k. We are choosing 9 uniformly between ^ and 1. This gives the expression 


ALG = 2- 


k pi ^ ^ /*1 


f{Ai{9) U U{9))d9. 


We claim that for 9 € [|, 1], A{(9) U U{9) can be written equivalently as Ai( 1 — 9) UB(1 — 9). We 
consider three cases for each element j: 
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• If Xij > |, then j G Aj(0) U 3/(0) for every 9 G [ 5 ,1], because x^y < 5 for every other i' / i and 
hence j cannot be allocated to any other terminal. Similarly, j G Aj(l — 0) U 33(1 — 0) for every 
9 G [^, 1], because 1 — 9 < \ and so j G Aj(l — 9). 

• If < \ and Xij = maxj/ x t >j , then again j G A,(0) U 3/(0) for every 0 G 1], because j is 
always in the unallocated set U(8). Also, j G A,( 1 — 0) U 13(1 — 9), because 13(1 — 9) = 3/(0). 

• If < 5 and x^ < maxj/ Xj/j, then j G A*(0) U U(9) if and only if j G 3/(0) = 33(1 — 0). Also, 
we have x t j = 1 — x i'j < 1 — max*/ xyj , and therefore j G Aj(l — 0) U 33(1 — 9) if and only 
if j G 33(1 — 0). 

To summarize, for every 0 G [|,1], j G A{(8) U U(9) if and only if j G Aj( 1 — 0) U 33(1 — 0). 

Therefore, the total expected cost can be written as 


( 2 -f)E / 1 /w^ + ^E f /(A(0)uc/(0))d0 

2 -?)e r /(^W)d<?+|E r /(A(i-0)u33(i-0))d0 

/ j =1 31/2 K i=1 J 1/2 

o\ ^ /*1 O ^ /»l/2 

2 - E J l/2 i E / o iW 0 ) u 


fc „i 


ALG = 


□ 


In the rest of the analysis, we prove several inequalities that relate the LP cost to the ALG cost. 
Note that the integrals fy 2 f(Ai(9))d8 appear in both LP and ALG. The non-trivial part is how to 

relate fj /2 f(A i (8'))d8 to / Q 1/2 /(A*(0) U B(9))dd. 

The following statement was proved in [J]; we give a simplified new proof in the process of our 
analysis. 

Lemma 2.5 (Theorem 1.5 in [3]). Let f > 0 be submodular, /(0) = 0, and x a feasible solution to 
SubMP-Rel. For 8 G [0,1] let Ai(9) = {x | x v>i > 9}, A{9) = ufL,A,(0) and 3/(0) = V \ A(8). For 
any 6 G [|, 1] the following holds: 

E f HM0))d8> f f{A{8))dd+ C f{U(8))dd. 

■ =1 J 0 Jo Jo 

In the following, we assume the conditions of Lemma 12.51 without repeating them. First, we prove 
the following inequality. 

Lemma 2.6. For any 6 G [4,1], 

E I /((^i( 0 ) u • • • Ai{9)) n A i+1 {8))dd > f f{U(8))d9. 

*=1 Jo Jo 

Proof. First consider 5 = 1. We can view the value fj f(Ai(8)U- ■ ■UAi(9))nAi + i(8))d9 as the Lovasz 
extension evaluated on the vector y* = (xi V-■ ■ Vx,) Ax,+i. Note that v G (Al( 0)U-• -U A(0))nAj + i(0) 
if and only if y v j > 9. Therefore 

f 1 /((A,(0) U • • • U Ai(9)) n A i+1 (9))d6 = /( yi). 

Jo 
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We can also view f(U(9)) as follows: Let u = Yli=i y'* = 1 — (xi V • • • V x*,). (This holds because 
J2i =1 Yi + ( x i V • • • V Xfc) = Eti 1 (( X 1 V ■ ■ ■ V Xj) A Xi+i) + (xi V ■ ■ ■ V x fc ) = X^ =1 x o which can be 
proved by repeated use of the rule (u A v) + [u V v) = u + v, and finally Yli=i x * = !•) Therefore 


k- l z 


k—1 / -j k—1 

E /(yo ^ E ■ 


2=1 


(/ is convex) 


2=1 


= / 


= E— /(u) 


fc- 1 


— i ■ 


where we also used the fact that /(ccx) = a/(x) for any a E [0,1] (/(x) is linear under multiplication 
by a scalar). Equivalently, 

E f 1 mMO)u---M8))nA t+ 1 (O))d0> Cf{u(0))de. 

i=i Jo Jo 

Now note that, if 6 > 5 > 1/2, the sets (Ai(0)U- • Ai(0))C\Ai + \(6) are empty, since x * = 1 

and hence two vectors Xj, x,; + ] cannot have the same coordinate larger than We also assumed that 
/(0) = 0, so we proved in fact 


as desired. 


E ! f({A 1 (d)u---M&))riA i+1 (0))de> 

i =i - 7o 



f(U{0))de 


□ 


Given this inequality, Lemma 12.51 follows easily: 


Lemma 001 By applying submodularity inductively to the sets Ai(0) U • • • U A t (9) and Ai + \{6), we 
get 


k k—1 

/((Ai(0) u • • • u Ai(0)) n A i+1 (6)) + fiA^e) U • • • U A fc (0)) 

2=1 2=1 

k -1 

= E /((^i W u • • • U M6)) n A i+1 (0)) + f(A(9)). 

2=1 

Integrating from 0 to 6 and using Lemma 12.61 we obtain 

j^» ^ j^*_1 c f 

E/ HM 0 ))dO > E [ f((A 1 (6)U---UM9))nA i +i(O))d0+ f f(A(6))dO 

i =i 70 i=i 70 70 

1 /*5 

/(i7(0))d0 + / 4(0)d0. 

Jo 



□ 


A corollary of Lemma 12.51 is the following inequality. 

Lemma 2.7. 

/- 1/2 rl/2 

E / f(Ai(O))d0 > / f(B(0))dd. 

i =i - y o 7 ° 
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Proof. Considering Lemma 12.51 we simply note that U(9) = B{ \ — 9). We discard the contribution of 
f(A(9)) and keep only one half of the integral involving B{1 — 6). □ 


We combine this bound with the following lemma. 

Lemma 2.8. 

k - 1/2 k rl/2 


k rl/2 k rl/2 r 1/2 

V / f(M9))de > V / KMO) U B(e))<w - (k - 2 ) / f(B(9))de. 

t=i Jo i =1 Jo Jo 

Proof. For simplicity of notation, we drop the explicit dependence on 6, keeping in mind that all the 
sets depend on 9. By submodularity, we have /(Aj) + f(B) > /(Aj UB) + /(Aj n B ). Therefore, 

k k 

> ^2(f(A i UB) + f(A i nB)-f(B)) 


i— 1 


i =1 
k 


Y^f(AiUB) + J2f(AnB)-k-f(B) 


i=l 


Z— 1 


This would already prove the lemma with k instead of — 2; however, we use JT =1 /(Aj n B) to 
save the additional terms. We apply a sequence of inequalities using submodularity, starting with 
f(A 1 n B) + /(A 2 n B) > f(A\ nA 2 nB) + /((^ U A 2 ) n B), then /((A x U A 2 ) n B) + /(As nB)> 
f{{A\ U A 2 ) n A 3 n B) + /((Ai U A 2 U A 3 ) n B), etc. until we obtain 


fc-i 


/(Aj n B) > /((Ai u... u Aj) n Aj +1 n5) + /((Ai u... u A k ) n B). 


i=l 


2=1 


The last term is equal to f(AnB). Moreover, we observe that for every element j . at most one variable 
Xjj can be larger than 1 — max,/ x^j (because otherwise the two variables would sum up to more than 
1). Therefore for every i, (Ai U ... U Aj) n Aj+i C B. So we get 

k k— 1 

£ /(Aj n B)>Y^ /((Ai u... u Aj) n Aj +1 ) + /(A n B). 

i=l i=l 

Integrating from 0 to 1/2, we get 

k r 1/2 rl/2 rl/2 

V / /(Aj n B)d9 > V / /((Ai u... u Aj) n Aj +1 )d<? + / /(An5)d0. 

i=i Jo i=i Jo Jo 


By Lemma 12.61 (recalling that Aj = Aj(0)), we obtain 

k rl/2 


« /- 1/2 />! /- 1/2 
V/ f(AnB)d9> f(U)d9+ f(AnB)dd. 
i=1 Jo Jo Jo 

Using B(9) = 17(1 — 0), submodularity, and the fact that U is the complement of A, we obtain 

k [■ 1/2 /- 1/2 /- 1/2 /* 1/2 

V/ /(AjnB)d0 > / f(B)d9+ / f(U)d9+ f(AnB)d6 

i=1 Jo Jo Jo Jo 


> 


f l/2 rl/2 

/ /(B)d0 + / /(t/u (An£))dtf 

7o 7o 

ri/2 rl/2 

/ /(B)d0 + / /(E/U5)dl9 

7o 7o 
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Finally, for 9 £ [0, ^], we claim that U U B = B. This is because if max,; x t j > then j ^ U, and 
hence the membership on both sides depends only on j £ B. If rriax^ x t j < then j £ B and hence 
also j £ U U B. We conclude that 


and 



r 1/2 

/(^ n S)d0 > 2 / /(R)d# 

Jo 



k j-l/2 

E / (/(AUB) + /(An5) - /(5))d0 

i=i - 7o 

/- 1/2 /* 1/2 
V / /(^ U R)d0 - (k - 2) / f{B)de 

i =1 7 ° 7 ° 


which finishes the proof. 


□ 


A combination of Lemma 12.71 and Lemma 12.81 relates Jq^ f(Ai(0))d9 to ^i=i Jo^ 2 /(7f (#) U 
B(9))d9, and finishes the analysis. 

Proof of Theorem \2.2l Add up |rf x Lemma 12.71 + ^-j-x Lemma [2781 



f{Ai{0))d0 > 



/(^(0) U B(9))de. 


Adding Yli=\ f \/2 f(Ai(O))d0 to both sides gives us that 


k -i k -i 

E / /(^(w > E / 

i=1 7o 7i/: 


i=i yi /2 


/(Ai( 0 ))d 0 + 


fc - 1 


k rl/2 

E / /(A(0)UR(0))d0. 

i=l 70 


The left-hand side is equal to LP, while the right-hand side is equal to 2 - 2 /fc ( see Lemma f2~Tl) . □ 


3 Hardness from the Symmetry Gap 

Here we show how the symmetry gap technique of [25] applies to submodular minimization problems. 
We remark that while the technique was presented in [25] for submodular maximization problems, it 
applies to submodular minimization problems practically without any change. Rather than repeating 
the entire construction of [25], we summarize the main components of the proof and point out the 
important differences. Finally, we mention that the recent techniques of [8] turn a query-complexity 
hardness result into a computational hardness result. First, we show the result for general Submodular 
Multiway Partition, which is technically simpler. 


3.1 Hardness of Sub-MP 

Here we show that the (2 — 2/fc)-approximation is optimal for Submodular Multiway Partition in the 
value oracle model. More precisely, we prove the following. 

Theorem 3.1. For any fixed k > 2 and e > 0, a (2 — 2/k — e)-approximation for the Submodular 
Multiway Partition problem with k terminals in the value oracle model requires exponentially many 
value queries. 
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We note that this result can also be converted into a computational hardness result for explicit 
instances, using the techniques of | 8 ]. We defer the details to Appendix IA.2I and focus here on the 
value oracle model. 

A starting point of the hardness construction is a particular instance of the problem which exhibits a 
certain symmetry gap, a gap between symmetric and asymmetric solutions of the multilinear relaxation. 
We propose the following instance (which is somewhat related to the gadget used in |7j to prove the 
APX-hardness of Multiway Cut). The instance is in fact an instance of Hypergraph Multiway Cut 
(which is a special case of Submodular Multiway Partition). As in other cases, we should keep in 
mind that this does not mean that we prove a hardness result for Hypergraph Multiway Cut, since 
the instance gets modified in the process. 

The symmetric instance. Let the vertex set be V = [k] x [k] and let the terminals be t ? ; = (i,i)- 
We consider 2k hyperedges: the “rows” Rj = {(i,j) : 1 < j < k} and the “columns” Cj = {( i,j ) : 
1 < i < k}. The submodular function / : 2 V —» M + is the following function: for each set S, 
f(S) = i 0(1*5' C Ri\) + Xq=i 0(1 S n Cj I), where 4>(t) = t/k if t < k and </>{t) = 0 if t = k. 

Since (j) is a concave function, it follows easily that / is submodular. Further, if a hyperedge is 
assigned completely to one terminal, it does not contribute to the objective function, while if it is 
partitioned among different terminals, it contributes t/k to each terminal containing t of its k vertices, 
and hence 1 altogether. Therefore, f($i) captures exactly the number of hyperedges cut by a 

partition (Si,..., S/-)- 

The multilinear relaxation. Along the lines of [25], we want to compare symmetric and asymmetric 
solutions of the multilinear relaxation of the problem, where we allocate vertices fractionally and 
the objective function / : 2 V —» M + is replaced by its multilinear extension F : [0,1]^ —>• R + ; we 
have T(x) = E[/(x)], where x is the integral vector obtained from x by rounding each coordinate 
independently. The multilinear relaxation of the problem has variables x\ 3 corresponding to allocating 
(i, j) to terminal te. 


k k 

min | F (* e ) ■ Vi,j €[k};Y x ij = 1 . 

i=i e=i 

V* € [fc];*L = 1, 

Vi,j,l € [k] ; x\j > o}. 

In fact, this formulation is equivalent to the discrete problem, since any fractional solution can be 
rounded by assigning each vertex (i,j) independently with probabilities x\,-, and the expected cost of 
this solution is by definition Yle=i -^( x<? )- 

Computing the symmetry gap. What is the symmetry gap of this instance? It is quite easy to 
see that there is a symmetry between the rows and the columns, i.e., we can exchange the role of rows 
and columns and the instance remains the same. Formally, the instance is invariant under a group Q 
of permutations of V, where Q consists of the identity and the transposition of rows and columns. A 
symmetric solution is one invariant under this transposition, i.e., such that the vertices (i,j) and (j, i) 
are allocated in the same manner, or x- = x 0. For a fractional solution x, we define the symmetrized 
solution as x = ^(x + x T ) where x T is the transposed solution (x T )^ = x :^. 

There are two optimal solutions to this problem: one that assigns vertices based on rows, and one 
that assigns vertices based on columns. The first one can be written as follows: x\ rj = 1 iff i = l and 0 
otherwise. (One can recognize this as a “dictator” function, one that copies the first coordinate.) The 
cost of this solution is k, because we cut all the column hyperedges and none of the rows. We must 
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cut at least k hyperedges, because for any i 7 ^ j, we must cut either row R{ or column Cj. Since we 
can partition all hyperedges into pairs like this ({ith,^}, {R 2 ,C^}, {i? 3 ,C 4 }, etc.), at least a half of 
all hyperedges must be cut. Therefore, OPT = k. 

Next, we want to find the optimal symmetric solution. As we observed, there is a symmetry 
between rows and columns and hence we want to consider only solutions satisfying x- = x ^ for 
all i, j. Again, we claim that it is enough to consider integer (symmetric) solutions. This is for the 
following reason: we can assign each pair of vertices (i,j) and (j, i) in a coordinated fashion to the same 
random terminal: we assign (i, j) and (j, i) to the terminal tp with probability x(- = x Since these 
two vertices never participate in the same hyperedge, the expected cost of this correlated randomized 
rounding is equal to the cost of independent randomized rounding, where each vertex is assigned 
independently. Hence the expected cost of the rounded symmetric solution is exactly Yli =1 F(x e ). 

Considering integer symmetric instances yields the following optimal solution: We can assign all 
vertices (except the terminals themselves) to the same terminal, let’s say t\ . This will cut all hyperedges 
except 2 (the row R\ and the column C\). This is in fact the minimum-cost symmetric solution, because 
once we have any monochromatic row (where monochromatic means assigned to the same terminal), 
the respective column is also monochromatic. But this row and column intersect all other rows and 
columns, and hence no other row or column can be monochromatic (recall that the terminals are on 
the diagonal and by definition are assigned to themselves). Hence, a symmetric solution can have at 
most 2 hyperedges that are not cut. Therefore, the symmetric optimum is OPT = 2k — 2 and the 
symmetry gap is 7 = (2k — 2)/k = 2 — 2/k. 

The hardness proof. We appeal now to a technical lemma from |25j, which serves to produce 
blown-up instances from the initial symmetric instance. 

Lemma 3.2. Consider a function f : 2' —» R that is invariant under a group of permutations Q on 
the ground set X. Let F(x) = E[/(x)], x = E cre g[cr(x)], and fix any e > 0. Then there exists 5 > 0 
and functions F, G : [0, l] v —>• R + (which are also symmetric with respect to Q), satisfying: 

1. For all x € [0, l] v , G(x) = E(x). 

2. For all x G [0, l] v , |F(x) — F(x)| < e. 

3. Whenever ||x — x || 2 < 5, F(x) = G(x) and the value depends only on x. 

4- The first partial derivatives of F,G are absolutely continuous H 

5. If f is monotone, then > 0 and > 0 everywhere. 

6. If f is submodular, then / )x . < 0 and q^..q x , < 0 almost everywhere. 

We apply Lemma 13.21 to the function / from the symmetric instance. This will produce continuous 
functions F, G : 2} —>• M + . Next, we use the following lemma from [25] to discretize the continuous 
function F, G and obtain instances of Sub-MP. 

Lemma 3.3. Let F : [0, l] v —>• R be a function with absolutely continuous first partial derivatives. Let 
N = [n], n > 1, and define f : N x V R so that f(S) = F(x) where 37 = D (N x {i})|. Then 

1. If > 0 everywhere for each i, then f is monotone. 

2. If q x .q x . < 0 almost everywhere for all i,j, then f is submodular. 

3 A function F : [0, l] v —► R is absolutely continuous, if Ve > 0; 35 > 0; ||xj—yj|| < 5 => Y(h=i |T( x i)—E(yi)| < t. 
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Using Lemma 13.31 we define blown-up instances on a ground set X = N x V as follows: For each 
i E N, choose independently a random permutation a E Q on V, which is either the identity or the 
transposition of rows and columns. Then for a set S C N x V, we define £(S) E [0, l] v as follows: 


fc(5) 


{ieN:(i,aU(j))eS} 


We define two functions f,g : 2 V —>• M_|_, where 


/(5) = F(£(S)), 


5(5) = G(^(5)). 


By Lemma 13.31 f,g are submodular functions. We consider the following instances of Sub-MP: 


max 


k 

f(Se) : (Si, ..., Sk) is a partition of X & Vi E N; E [A:]; (i, if) E <Sf 

£=1 


r 

max < y^g(S() : (S\, ..., Sk) is a partition of X Sz \/i E IV; W E [A:]; (■ i,tfi) E Sf 
l f=i 

Note that in these instances, multiple vertices are required to be assigned to a certain terminal (n 
vertices for each terminal). However, this can be still viewed as a Submodular Multiway Partition 
problem; if desired, the set of pre-labeled vertices Tf = N x {if} for each terminal can be contracted 
into one vertex. 

Finally, we appeal to the following lemma in [ 23] . 

Lemma 3.4. Let F, G be the two functions provided by Lemma \3.S[ For a parameter n E and N = 
[n], define two discrete functions f,g : 2 NxV —>• M + as follows: Let cr W be an arbitrary permutation 
in Q for each i E N. For every set S C N x V, we define a vector ^(S 1 ) E [0, l] v by 


ys) 


{i€N:(i,aM(j))€S} 


Let us define: f(S) = F(£(S)), g(S) = G(f(S)). Then deciding whether a function given by a value 
oracle is f or g (even using a randomized algorithm with a constant probability of success) requires an 
exponential number of queries. 


Lemma 13.41 implies that distinguishing these pairs of objective functions requires an exponential 
number of queries. We need to make one additional argument, that the knowledge of the terminal 
sets Te = N x {tp} (which is part of the instance) does not help in distinguishing the two objective 
functions. This is because given oracle access to / or g, we are in fact able to identify the sets T(, 
if we just modify the contribution of each row/column pair R.(, by a factor of 1 + q, where q is 
some arbitrary small parameter. This does not change the optimal values significantly, but it allows 
an algorithm to distinguish the sets Ti easily by checking marginal values. Note that then we can also 
determine sets such as = N x {(i,j), ( j,i )}, but we cannot distinguish the two symmetric parts of 
Ti j, which is the point of the symmetry argument. In summary, revealing the sets T) does not give 
any information that the algorithm cannot determine from the value oracles for /, g. and given this 
oracle access, / and g cannot be distinguished. 

It remains to compare the optima of the two optimization problems. The problem with the 
objective function / corresponds to the multilinear relaxation with objective F, and admits the “dic¬ 
tatorship” solution Si = {(i,j) : 1 < j < A} for each i E [ k ], which has a value close to k. On the 
other hand, any solution of the problem with objective function g corresponds to a fractional solution 
of the symmetrized multilinear relaxation of the problem with objective G , which as we argued has a 
value close to 2k — 2. Therefore, achieving a (2 — 2 /k — e)-approximation for any fixed e > 0 requires 
an exponential number of value queries. 
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3.2 Hardness of Symmetric Submodular Multiway Partition 

Here we state a result for the Sub-MP-Sym problem. 

Theorem 3.5. For any fixed k sufficienly large, a better than 1.268-approximation for the Sub-MP- 
Sym problem with k terminals requires exponentially many value queries. 

The proof is essentially identical to the previous section, however the symmetric instance is different 
due to the requirement that the submodular function itself be symmetric (in the sense that f(S) = 
f(S)). The analysis of the symmetry gap in this case is technically more involved than in the previous 
section. The result that we obtain is as follows; we defer the proof to Appendix [Aj 

4 Hardness from Unique Games 

In this section, we formulate our general hardness result for Min-CSP problems, and in particular we 
show how it implies the hardness result for Hypergraph Multiway Cut (Hypergraph-MC). 

4.1 Min-CSP and the Basic LP 

The Min-CSPs we consider consist of a set of variables and a set of predicates (or cost functions) 
with constant arity over the variables. The goal is to assign a value from some finite domain to each 
variable so as to minimize the total cost of an assignment. Alternatively, we can view these variables 
as vertices of a hypergraph and the predicates being evaluated on the hyperedges of the hypergraph. 

Definition 4.1. Let 6 = {’L : [g]* —>• [0,1] U {oo} | i < k} be a collection of functions with each 
function in 13 has at most k input variables in [g] and outputs a value in [0,1]. We call k the arity and 
q the alphabet size of the 6. 

An instance of the Min-6-CSP, specified by (V , E, ^e = {'U | e E E}, we = {rc e | € E}), is defined 
over a weighted k-multi-hypergraph G(V,E). For every hyperedge e = (uq,Ui 2 ,.., v; L] ) E E, there is an 
associated cost function 'he E 13 and a positive weight w e . The goal is find an assignment i : V —» [g] 
for each vertex v E V so as to minimize 

Y w e -^ e (£(vi 1 ),...,i{y ij )). 

e=(v il ,...,v ij )eE 

If there is a subset of vertices (which are called the terminal vertices) such that each has a single 
required label, we call the corresponding problem Min-6-TCSP. If for every vertex v, it is only allowed 
to choose a label from a candidate list L v C [g] (L v is also part of the input), we call the corresponding 
problem Min-6-LCSP. 

Given a Min-B-CSP (or Min-B-TCSP/Min-13-LCSP) instance, it is natural to write down the fol¬ 
lowing linear program. We remark that this LP can be seen as a generalization of the Earthmover LP 
from df and has been referred to as the Basic LP DUES]. The LP captures probability distribu¬ 
tions over the possible assignments to each constraint (which is why we referred to this as the Local 
Distribution LP in the conference version of this paper). 

The Basic LP. There are variables x ej0l for every hyperedge e E E and assignment a E [g]^, and 
variables x v j for every vertex v € V and j E [g]. The objective function is of the following form: 

LP(Z) = min Y We ' ^2 x e,a ’ ’UU) 
e Q'G[q , ]l e l 
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under the constraint that, for every v € V, 


q 

~y'] x v ^i — i 

*=i 

where 0 < x v _i < 1 for every v,i. We also have constraints that for every edge e = (v\,V 2 ,Vj) and 
i € S and go € [g] 

x Vi,q 0 = x e,a- 

Oii=qo 

where 0 < x Vi>a < 1 for every x Vua . 

As for Min-B-LCSP as well as Min-B-TCSP, we would add the following additional constraint: for 
every go that is not a feasible label assignment for v, we would add 

x v,q 0 = 0- 

To see why this is a relaxation, one should think of x V i as the probability of assigning label i to 
vertex v. For every edge e = (v\,V 2 ,Vj) and labeling a = (fa, fa,..., fa) for the vertices of e, x eyQl 
is the probability of labeling the vertices of e according to a (that is, vertex tg receives label l fa. For 
every edge e, we define V e as the distribution that assigns probability x e ,a to each a € [ q ]l e L 

4.2 The Min-CSP hardness theorem 

Definition 4.2. For some i > 2,we define NAE*(xi, X 2 , ■■■xfa : [g] 1 —)• 0,1 to be 0 if x i = X 2 , ...,= Xi 
and 1 otherwise. 

Theorem 4.3. Suppose we have a Min-$-CSP(TCSP/LCSP) instance I(V. E. T e, we) with fractional 
optimum (of the Basic LP) LPiX) = c, integral optimum OPT(Z) = s, and fi contains the predicate 
NAE 2 . Then for any e > 0, for some A > 0 it is Unique Games -hard to distinguish between instances 
of Min-ih-CSP(TCSP/LCSP) where the optimum value is at least (s — e)A, and instances where the 
optimum value is less than (c + e)A. 

As a corollary of Theorem 14.31 we obtain a hardness result for Hypergraph Multiway Cut. This 
follows from a known integrality gap example for Hypergraph Multiway Cut, reformulated for the 
Basic LP. 

Corollary 4.4. The Hypergraph-MC problem with k terminals is Unique Games -hard to approx¬ 
imate within (2 — | — e) for any fixed e > 0. The same hardness result holds even the hyperedge of the 
graph has size at most k. 

Proof. Let fi*, = {NAEj : [k] 1 —>• {0,1} | i = 2 ..., k}. 

First, we claim if we have an a-approximation for the Hypergraph-MC with k terminals for some 
constant a > 1, then we can also have an a-approximation for the Min-fifc-TCSP. To see this, we make 
the following reduction. Take any instance of the Min-B^-TCSP instance, it can almost be viewed 
as a k- way Hypergraph-MC instance on fc-hypergraph as each constraint NAEj(vi, V2 , ■ ■ ■, vf) is 
corresponding to a hyperedge on V\,V 2 ,...,Vi. The only difference that is there may be multiple 
vertices fixed to be the same label in the Min-B^-TCSP instance. To address this, we only need to add 
k new terminals fa, t 2 , ■ ■ ■ ,t]~. For all the existing vertex associated with the label i in the Min-6/.-CSP 
instance, we would add an edge of infinite weight to the corresponding fa. 

Therefore, it remains to show the hardness of approximating Min-fi^-CSP better than 2 — 
Assuming the correctness of Theorem 14.31 consider the following Min-fi^-CSP instance TLk- there are 
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k(k + l)/2 vertices indexed by (i,j) for 1 < i < j < k. We have k hyperedges: for every i E [k], the 
hyperedge e* is dehned as e* = {(* 1 , 12 ) € [A:] 2 \ i = i\ < 12 or i\ < 12 = i}- We define the k terminals 
as ti = (i,i) for every i E [k], with the label of required to be i. 

We claim that OFT(FLk) > k — 1; i.e., there is no assignment with cost 0 on more than one edge. 
Without loss of generality, suppose the optimal solution has cost 0 on edge e\, i.e, assign label 1 to 
every vertex indexed by (1 ,i) for i E [k\. Then we cannot have cost 0 for any of the remaining k — 1 
hyperedges because to satisfy e*, we would need (l,i) to be labeled by i. 

On the other hand, LPfiHk) < k/2. The following is a fractional solution: for every vertex v = 
( i,j ), x Vi i = 1/2 and x v j = 1/2. All the other variables x v ^j are 0 (for k! i,j ). For every edge 
e* with its vertices ordered as (1, i), (2, i),... , (i, i), (i, i + 1 k), we have x e / ii ^ = 1/2 and 
x e,(i, 2 ,...,k) = 1/2. This satisfies all the constraints and achieves an objective value of k/2. 

Therefore, applying Theorem 14.31 we get that it is Unique GAMES-hard to approximate Min-fi*,- 
TCSP beyond the factor = 2 — which implies the same hardness of approximation ratio for the 
k- way Hypergraph-MC problem (even on fc-hypergraph as the arity of Bfc is k). □ 

In the following, we give a proof of Theorem 14.31 which is by an extension of the technique of [Uj. 
In Section 14.31 we first review some standard definitions from the analysis of boolean functions. Then 
we describe our reduction and analyze it in Section 14.41 

4.3 Tools from Discrete Harmonic Analysis 

We now recall some standard definitions from the analysis of boolean functions. We will be considering 
functions of the form / : [q] n —>• M fc , where q,n,k E N. The set of all functions / : [q] n —>• forms a 
vector space with inner product 

(f,g)= E [ {f(x),g(x))}; 

x~[q] n 

here we mean that x is uniformly random and the (•, •) inside the expectation is the usual inner product 
in M fc . We also write ||/|| = \J (/, /) as usual. 

Definition 4.5. For random x,y E [q] n , we say that y is p-correlated with x if given x, we generate y 
by setting yi = Xi with probability p or randomly in [q] with probability 1 — p, independently for each i. 
For 0 < p < 1, we define T p to be the linear operator given by 

Tpf (x) = E[/(y)], 
y 

where y is a random string in [q] n which is p-correlated to x. We define the noise stability of / at p 
to be 

Stab,!/] = (f ,T p f). 

Definition 4.6. For i E [n], we define the influence of i on / : [q] n — >• to be 

Inf*[/] = E Var [/ (x)] , 

Xl,...,Xi_i,Xi + i,...,Xn~[gJ L 

where Var[/] is defined to be E[||/|| 2 ] — || E[/]|| 2 . More generally, for 0 < S < 1 we define the £-noisy- 
influence of i on / to be 

Inf S 1_a) [/] = Inf fiT^sf}. 

The following facts are well known in the literature of discrete harmonic analysis. 


15 





Fact 4.7. 


InfM[/] = ^Inf l (1 - 5) [/ J ] J 

3 =1 

where fj : [q] n — >• R denotes the jth-coordinate output function of f. 

Fact 4.8. Let /W,..., /(*) 6 e a collection of functions [q] n R fc . For ci, C 2 ,...Q £ M (or [g ] 71 —>■ R fc ^, 
we use the notation avg(ci,... , q) to denote their (pointwise) average j Y^j =l c j- ^en 


Inf 


(1-5) 



< avg 
ie[t] 



For randomized functions with discrete output, /' : [g] n —> [k], we can view them as functions 
defined as / : [q] n —» A^ where A*, is the (fc — l)-dimensional standard simplex. The i-th coordinate 
indicates the probability that the function f outputs i. The following fact is also well known. 

Fact 4.9. For any f : [q] n A k , ]T ” =1 lni l~ 6 [f] < 1 / s - 

An important tool we need is the Majority Is Stablest Theorem from [18] . We state here a slightly 
modified version [ 20 ] using a small “noisy-influence” assumption rather than a small “low degree 
influence” assumption. 

Theorem 4.10. (Majority Is Stablest) Suppose f : [q] n —> [0,1] has Inf- 1_< ^[/] < r and E[/] = fi, 
then 

Stabi_ 5 [/] < Tis{h) + err(r,q,5) 

where for any fixed 5 and q, lim T _>.o err(r, q, 5) = 0. Here the quantity Ti-sip*) is defined to be Pr[x, y < 
t} when (x,y) are joint standard Gaussian with covariance 1 — 6 and t is defined by Pr[x < t\ = y. 

We will use the following asymptotic estimate for Ti_< 5 . 


Lemma 4.11. If 5 C < n < 1 — 6 C for some constant 0 < c < 1, 

n-r 1 - S ( f i) = n(s^ +2c ). 


Proof. Suppose that we have Pr x ^jv(o,i) \ x — A = P- Since (1 — <5)-correlated Gaussian variables 
(x,y) can be generated by starting with two independent Gaussian variables (x,z), and setting y = 
(1 - 6)x + V25 - 5 2 z , we can write 


Ti_ 5 (/x) = Pr[x < t, y < t] = Pr[(l — 6)x + \J 25 — 5 2 z <t\x<t]-/j, (1) 


where 

x, z are independent Gaussian. 

For 

t' = 

■ t — 

V26 

-5 2 , 

we have that 



Pr[(l 

— 5)x + \/ 25 — 5 2 z < t \ x < t] 

= 

Pr 

[x > 

t'\ x 

<t\ ' 

Pr[(l - 

- 5)x + y/25 - 5 2 

z < t | t' 

< x < t] 




+ Pr[x 

< t’ 

| x < 

t] • Pr[(l — 5)x + i/25 — 

5 2 z < t 

X < t'} 



< 

Pr 

[x > 

t'\ x 

<*]■ 

Pr [2 < 

t — (1 — 5)t r . 

—+ Pr [x < t \ 
V26 - 5 2 1 1 - ' 

X < t] 



= 

Pr 

[x > 

t'\ x 

<*]■ 

Pr [z < 

dt 

- - + (1 

sj25 - 5 2 

■ <5)] + Pr 

[x < t'\x < t] 



< 

Pr 

[x > 

t'\ X 

<*]■ 

Pr [z < 

V~5t + (1 — <5)] + 

Pr[x < t 

'\x < t] 



= 

Pi ■ 

(1- 

-P2) 

+ (1- 

~Pi) = 

1 - P 1 P 2 
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where pi = Pr[x > t'\ x < t] and P 2 = Pr [2 > \f~5t + (1 — (5)]. Hence we get that 

m - ri_«5(/i) = ppiP2- 

Below we prove 

1 . pi = H(J5+ C ). 

2. P2 = fi(l). 

Combining with the fact that p > 5 C , this will complete the proof of Lemma 14.111 
We need following property of Gaussian Distribution proved in [3]. 

Lemma 4.12. [Sj Let f(x) = ^=e 2 be the density function of Gaussian Distribution. For every 
s > 0, 

< Pr \g > s] = Pr [g < -s] < ^. 

S “hi g^N(0,l) g~N(0,l) S 

Recall that Pr^ 7 v(o,i )\d < i] = A*. In the case of fi < 1/2 (and therefore £ < 0), we have that 

p = Pr ( g<t)< 

0 , 1 ) |t| 

In the case that p > 1/2 (and therefore, t > 0), we also have that 

Pr (5 > t) = 1 - p < 

S~JV(0,1) |t| 


Combining the above two cases, we have that 

S c < min(l — p, p) < 
For sufficiently small 5 , we claim that 


M =_ 

\t\ y/2n\t\ 


1 

e 2 


\t\ < v / 2dog( 1/5), 

This can be proved by contradiction. Suppose that |f| > y / 2clog(l/d), we would have 


/(*) 


< 


< <5 C 


|t| y^c log (1/5) 

when 5 is small enough. Therefore, 

P 2 = Pr[;r > VSt + (1 — (5)] > Pr[z > \j2cd log(l/<5) + 1 ]. 


For sufficiently small 5, we have ^ /2c5 log(1 /<5) < 1 and P 2 > Pr[z > 2] > 0.1. 

Next, we establish a lower bound for p \: 

pi = Pr[t' < x < t\x < t] > Pr[t' < x < t\ > /(max(|i , |, |t|)) • (t — t') 

> f(\t\ + V26 - 6 2 )V2S - 5 2 > VS ■ f(\t\ + V2S-S 2 ). 

Noticing that \/2 S — S 2 < V2S, we have 


=-(\/ 2clo g( 1 / 5 )+ x ^) 2 / 2 g- (c log(l/ S)+5+y/ 46c log(l/<5)) 


> 


m + V^)>m + vi 6 )> ^ 

The last inequality holds for sufficiently small constant 5 and therefore pi > . Overall, we have 

that 

p - Ti_s{p) > PP1P2 = D(52 +2c ) 

and this finishes the proof of Lemma 14.111 □ 
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4.4 Reduction from Unique Games 

We describe a reduction from the Unique Games problem to Min-fi-CSP where fi contains the NAE 2 
predicate. 

Definition 4 . 13 . A Unique Games instance U(V, E, {tt u ,v} ( u ,v)eE 1 R) consists of a regular graph 
G(V,E) and each edge e = (u, v) € E is associated with a permutation n uv : [R] i-a [R\. For a given 
labeling L : V eA [R], an edge e = (u, v) E E is said to be satisfied if L{v) = n UiV (L(u)). We denote 
OPT(Z4) to be the maximum fraction of edges that can be satisfied by all the labeling. 

Conjecture 4.14 (The Unique Games Conjecture). Given a Unique Games instance IA, for every 
e > 0, there exists some large enough integer R , given a Unique Games instance U (V, E, {-k u ,v} ( u ,v)eE> R) > 
it is NP-hard to distinguish between the following two cases: 

• OPT(U) > 1 - e; 

• OPT(U) < e. 

Now we are ready to prove Theorem 14.31 We first prove the hardness result for Min-B-CSP and 
then extend it to Min-fi-LCSP as well as Min-fi-TCSP. The reduction takes a Unique Games instance 
U(V,E , {7r e |e € E},R) and it maps U to a fi-CSP instance At, using an integrality gap instance in 
the process. Suppose the integrality gap instance is 

1' = (V',E',V ei = {T e | e E E?},w E ' = {w e \ € E'}) 

and suppose that \V'\ < rn. OPT(X') = c and LP(X') = s. Let us assume that the arity of B is k and 
the alphabet size of B is q. Without loss of generality, let us assume that the sum of the weights over 
all the hyperedges in E' normalized to 1; thus we can view the weights w e as a probability distribution 
over hyperedges. The reduction produces a new fi-CSP instance M with the following properties, for 
some parameter 0 < rj < 1 to be specified later: 

1. Completeness property: if OPT (U) > 1 — e, then OPT(A4) < rj(c + 0(l/m)); 

2. Soundness property: if OPT (U) < e, then OPT(A4) > rj(s — 0(k/m)). 

By taking sufficiently large m, we have that it is Unique GAMES-hard to get any approximation 
better than | — eo for any eo > 0 for the Min-B-CSP. 

The vertex set of ATs is going to be V x V' x [q] R . ■ The sum of the hyperedge weights in A4 is 
equal to one and we described it as a distribution over all the hyperedges. 

Recall that we denote by V e the probability distribution over assignments in [q] e corresponding 
to the optimal fractional solution of X', restricted to the hyperedge e. Given any 7r : [X?] —>• [R] and 
x € [g] R , we use tt(x) to indicate a vector in [gp such that w(x)i = x n ny 
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Reduction from Unique Games to B-CSP. 

We choose the parameters as follows: rn > max(|V 7 |, q *, k 2 ), e < l/m 80 ,r/ = 1/m 39 ,6 = 

1/m 40 . The weight w e of a hyperedge e with cost function T e is the probability that it is 
generated by the following procedure. 

• (Edge test) With probability r], we pick an edge e = (v[,v' 2 , ■ ■ ■, v'j) from E'. Then we 
randomly pick a vertex v from V and randomly pick j of its neighbors v\,v 2 ,Vj. 

We generate x 1 ,x 2 ... ,x? € [q] R according to V R . Output the cost function T e on 
(vi,v[,ir Vl!V ( x 1 )), (■ v 2 ,v' 2 ,ir V2jV (x 2 )) ..., n VjtV (x 3 )). 

• (Vertex test) With probability (1 — 77 ), we pick a vertex v from V and two of its 
neighbors v \ , v 2 ■ We randomly pick a vertex v' from V'. Then we generate (1 — <5)- 
correlated x,y € [q] R and output a cost function NAE 2 on (vi,v',ir vi>v (x)) and 
(v 2 ,v',Tr V2:V (y)). 

A function f : V x V' x [q] R —>• [q] corresponds to a labeling of the instance M.. Let us use Val(/) 
to denote the expected cost of / and Val e dge(f) and Val„ ertea; (/) to denote the expected cost of / 
on the edge test and the vertex test, respectively. Also let us use the notation f v y : [q] R —>• [q] 
to denote the restriction of / to a fixed pair v G V,v f G Vf v y(x) = f(v,v',x). We know that 
Val(/) = (1 — rf) ■ Val V ertex(f ) + V ' Val e dge{f)- In the following we prove the completeness and 
soundness property of the reduction. 

Lemma 4.15 (completeness). If OPT(W) > 1 — e, then OPT(A4) < rj(c + 0(l/m)) = c/m 39 + 
0 (l/m 40 ). 

Proof. Suppose that a labeling A : V —>• [R] satisfies (1 — extraction of the edges in the Unique Games 
instance. Let us consider a “dictator labeling” f v y (x) = xyy for every (v,v') € V x V'. Since a 
(1 — extraction of the edges in the Unique Games instance can be satisfied, by an averaging argument 
and the regularity of the graph, we know that for at least a (1 — \/e)-fraction of the vertices, we have 
that at least a (1 — \/e)-fraction of its neighbors is satisfied by A. Therefore, by a union bound, when 
choosing a random pair of neighbors of a random vertex in V, with probability at least 1 — 3-yT the 
two edges are satisfied by A. This means that for the vertex test, with probability at least 1 — 3^/e 
we have 7 t 1 , 1 j „(A(ui)) = ir V2tV (A.(v 2 )). Conditioned on this, the cost of the vertex test is at most 5, as 
NAE 2 ( 7 r t , 1 ^(x (A (t, 1 )) = TTv 2 ,v(y(A(v 2 )))) = 0 with probability (1 — <5). Overall, we have that the vertex 
test cost is at most (1 — 3y/e) ■ 6 + 3y/e. 

As for the edge test, by an extension of the argument above, with probability at least 1 — 
(m + 1)\/q we have vr„ liW (A(ui)) = ^^(A^))... = n VjtV (A(vj)). Conditioned on this, since 
(x 4 ( A (i>i ))> ■ ■ ■ > x t- (A(v))) ~ ^ e ’ cost of the edge test corresponds exactly to the cost of the 

fractional solution of the LP: 


E [*e04 


e,vi,V2,---,Vj 


TIv 1 , v (A(V1 ))’*** ’ 7T v . v (A(Vk) 


)] = E e n«( q )] 


= c. 


Ot r ^'Pe 


Therefore, the cost of the edge test is at most (1 — (m + P)\fe) ■ c + (m + ~I)y/e. 
Overall, the cost of the dictator labeling is 

y ■ ((m + l)y/e + (1 - (m + 1)^) ■ c) + (1 — rf) ■ ((1 - 3\/i) • 5 + 3y/e). 

By the choice of parameters, we obtain that Val(/) = c/?n 39 + 0(l/m 4 °). 


□ 
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It remains to prove the following soundness property. 

Lemma 4.16 (soundness). If OPT(U) < e, then OPT(Al) > r/(s — 0(k/m )) = s/m 39 — O(k/m i0 ). 

Proof. We will prove this by contradiction. Assume that there is an assignment / such that Val(/) < 
s/m 39 — 0(l/?n 4 °). The cost of the vertex test is: 


Val vertex ( f ) 


E [NAE2 {fvi,v' iPv\,V (•£))) fv 2,v' (jfv2,v (y)))] 
vi,V2 ,v' ,x,y 


g 



In the above expression, /* , is the indicator function of whether f v y = i. Also u, ~ v means Vi is a 
random neighbor of v and i,j/€ [gp are (1 — d)-correlated. If we define g l v v ,(x) = v ,{'K u ,v{x)\, 

we have that 


Val verteM) = 1 E [gty (x) • g[y (*/))] 

' v' v,x.y ’ ’ 

i =1 L 

g 

i - Y Stah i-s[g l v, v 


= E 

v.v' 


i= 1 


Recall that k < y/m, by the choice of parameter, we have that 

Val(/) = (1 - rj) • Val ver tex{f) + ??Val edge {f) = s/m 39 - 0{k/m 40 ) = 0(l/m 39 ), 

we know then the cost of the vertex test Val ver tex{f) is 0(l/m 39 ). Let g v y = (g/y , g/y ,... ,g q v y)\ 
9v,v' € A 9 as 3®y(a;) = E u ^v[f u y(^u,v( x )} and each € A g for an y u i by the 

dehnition of f/ v , as the indicator function of f u y = i. For every g v y, we classify it into the following 
three categories: 

(1) dictator function: there exists some g 1 , with its 5-noisy-influence influence above r (with r being 
specihed later), 

(2) constant function: there exists some i with E[g® v , ] > 1 — <5 ai , and 

(3) all the other g v y not in category (1) and (2). 

The main idea of the remaining proof is to show that for every v' € V' . in order to bound 
Val vertex(f) by 0(l/m 39 ), we must have g v y in category (2) for at least a 1 — 0(l/m 10 ) fraction of 
v € V. Then we argue that this will incur a big cost on the edge test. We proceed as follows: 

(i) Bound the fraction of g v y in category (3): Suppose for a fixed v' € V' and an a-fraction of 
v € V, g v y is in category (3); i.e., 

max Inf- S [g v ,v'} < r 
iGR 

and 

ma x E W v ,(x)\ < 1 - d 0 ' 1 . 

»e[fc] x£[k\ R 
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Then setting r to make err(r, q, 5) < 1/m 30 in Theorem 14.101 we have that 

Stable[ 5 *, y ] < Ti_ 5 (/r*y) + err(r, q, 5). 
where /x®,y = E x [g l v v ,(x)\. We know that 

XXy = = XI E {fu,v(nu,v{ x )] = L 


2=1 


2=1 


Suppose that /x®*^, has the maximum value among /x 3 v ,,Hy v ,, ■ ■ ■,Hy v ,, we know then 

y i 1 1 

ftv . v ' — — — 4 — ,'(1 ] • 

’ (/ md 0 ui 

We apply Lemma [4.111 to /x®*^,, observing that <5 ai < /x®*^, < 1 — <3 01 . We obtain (i l * v , — 
Ti s(n l *y) = 0(<5°- 7 ). For i / i* we simply use /x® v , - Stable (#®y) > 0. Therefore, 


2=1 


l-^ stabi - 5 b*y] = (^ly “ Stabr-sky]) 

2=1 

> /4y - ri-iC^V) - err (u 9,5) 

> 0 (( 5°- 7 )-0 

1 


'v,v r 

1 


m' 


30 


= n 


m 


28 


Overall, each particular v' E V is picked with probability at least 1/m, and if < 7 „„/ for an ef¬ 
fraction of v E V is in category (3), the vertex test will have cost fl (a • A. • —. In order to 
keep the cost of the vertex test bounded by 0 (l/m 39 ), we must have a = 0(l/m 10 ). 

(ii) Bound the fraction of g v y in category (1): For a fixed v' E V', suppose that for a /3-fraction 
of v E V, g vv i has a coordinate with 5-noisy influence above r. Then consider the following 
labeling for the Unique Games instance: for each v E V, we can just assign randomly a label 
from the following list: 

Ay = {i E R : Inf 3-5 [c^y] > r} U {i E R : Inf^/Z/y] > r/2}. 


Then by Lemma 14.91 


and 


Elnf^[/„y] < V<>, 

2=1 

Q 

2=1 


we know |A„| = 0(4^). 

By FactO for every 1 E A„, E^„[Inf,-~ 6 [/ u y ( 7 ^( 2 ;))]] > Inf \~ 5 [g v ,v'\ > r. By an averag¬ 
ing argument, at least a r/2-fraction of neighbors u ~ v have a coordinate j E R such that 
Inf ]~ S [f U; y] > r/2 and 7r VjU (i ) = j. Therefore, at least a r/2-fraction of edges in the Unique 
Games instance have candidate labels in their lists that satisfy the edge. Hence choosing the 
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label of v independently and uniformly from A„ will satisfy each edge with probability fl(/3r 3 c) 2 ). 
In expectation, we satisfy an n(/3r 3 ci 2 )-fraction of the edges of the Unique Games instance. 
Since we can take e (in the Unique Games Conjecture) to be an arbitrarily small constant. If we 
set e = min(r 3 • 1/m 4 • A 2 ,1/m 80 ), we conclude that /? = 0(l/m 4 ). 


Therefore, we can assume that for every v' € V', g v y for a (1 — 0(l/m 4 ))-fraction of v € V is in 
category (2); i.e., there exists some i € [q] such that E [g l vv ,\ = E> 1 — <5 01 = 1 — l/?n 4 . 
Therefore, if we pick a random v € V. we have that E„ e y [aiJ > 1 - 0(l/m 4 ). 

Notice the regularity of the graph, we have that 


E \9ly] = E [ E [/;, 

VE.V V£V Ur-JV 


,]]= E [/;„,]> 1 - 0(l/m 4 ). 

uev ’ 


By an average argument, for every v' € V' and for at least a (1 — 0(l/m 2 ))-fraction of the vertices 
u € V, we have that E[/i ,] > 1 — 1/m 2 . Since \V'\ < m, by a union bound, we have that for a 
(1 — 0(l/m)) fraction of the v € U, max,; E[/* ^,] > 1 — 1/m for every v' € V'. Let us call these v € V 
“good”. 

Given (v,v') fixed, let us just consider the labeling of (v,v',x) by arg max,; E[/* v ,\ for every x. 
This labeling has a cost at least s as it assigns a label depending only on ( v,v') which can be viewed 
as the cost of an integral labeling for the gap instance X'. 

Given a good v, f (iq, v^, tt V iV1 (a; 1 )), (v, V 2 , ir V2:V (x 2 )),..., (vj, v'j, ir v ^ Vj (x^)) in the reduction has 
the same cost as labeling each (vi,v , i ,n VijV (x)) with arg max* E[/* v ,\ on (1 — 0(j/m )) fraction of the 
hyperedges. Therefore, we have that Val ve rtex(f) > (1 — 0(j/m )) • s > (1 — 0(k/m )) • s. This implies 
that Val(/) > r/Val 1 , e rt ea; (/) > rj( 1 — k/m) ■ s = s/m 39 — 0(/c/m 4 °) which leads to a contradiction. 

It remains to prove the same result holds for Min-fi-LCSP and Min-fi-TCSP. For Min-fi-LCSP. 
Assuming that there is an integrality gap instance X' with LP(X) = c, OPT(X') = s. For any v' € V' 
with the candidate label set L v i we will remove the requirement that its label must be in L' v , instead 
we will add a unary cost function Pl v , (z) : [g] —>• {0, oo} defined as 


PlM 


0 if z € L v / 
oo if z ^ L v i 


on v'. Such a change transforms the Min-fi-LCSP instance X' into an Min-fi U {Pl v , W € U”}-CSP 
instance X". 

Since adding the Pl , constraint on v' is essentially the same as restricting the labeling of v' in 
L' v , we know that OPT(X') = OPT(X") and LP(X') = LP{X"). We will then use the reduction from 
a Unique Games instance U (with the integrality gap instance X”) to a Min-6 U {Pl , W € V”}-CSP 
instance JA, we still have that 


1. Completeness property: if OPT {U) > 1 — e, then OPT(Ad) < r)(c + 0(l/m)); 


2. Soundness property: if OPT (U) < e, then OPT(Ad) > r](s — 0(k/m )). 


Then we can convert Ai into a Min-B-LCSP instance At'. For any vertex u in Ai with Pl , on 
it, we will remove Pl , and set the candidate list of u to be L v r. By doing this, we removing all the 
appearance of Pl Such a conversion also has the property that OPT(Ad) = OPT(A4 / ). Therefore, 
we prove the same Unique Games hardness result holds for the Min-6-LCSP. 

As for Min-fi-TCSP, for any terminal vertex v' € V with fixed label i, we will remove the the 
requirement of the label of v’ to be i, instead we apply an equivalent cost function 


Pi(z) 


0 if z = i 
oo if z/t 


A similar argument as the proof for Min-fi-LCSP also holds. 


□ 
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5 Lovasz versus Basic LP 


Here we compare the Basic LP and the Lovasz convex relaxation, for problems that can be phrased 
both as a Min-CSP and as a Submodular Multiway Partition problem. That is, we consider a B-Min- 
CSP where each predicate 'Lg € B is of the form 

k 

Vein, /e({j : ij = *}) 

2—1 

and each f e : 2^ —> M_|_ is a submodular function. (Recall that Graph-MC, Hypergraph-MC and 
Node-Wt-MC fall in this category.) Then in both relaxations, we replace the labeling x v € [k] by 
variables y v ,i > 0 ,i € [k] such that X^=i Vv,i = 1- We also impose the list-coloring constraints y v> i = 0 
for i £ L v in both cases. The objective function is defined in different ways in the two relaxations. 

The Lovasz relaxation. We minimize S e eE i fe(Vv,i ■ v € e), where / is the Lovasz extension 
of /, /(y) = E 0e[o ,i][/(M y (0))] where A y {6) = {i : yi > 6}. 

The Basic LP. We minimize Yhe&E Sq ye,h ) ...,i k '^e(h, ■ ■ ■, k) subject to the consistency con¬ 
straints ye,h,...,ii = yv,ij where v is the j-th vertex of e. 

Lemma 5.1. The value of the Lovasz relaxation is at most the value of the Basic LP. 

Proof. Given a fractional solution of the Basic LP, with variables y v j for vertices v € V and for 

hyperedges e € E, each hyperedge contributes ye,h,...,i k '&e(h, ■ ■ ■, k), where ^ e (*i, ...,k) = 

Yli=i fe{{j '■ ij = *})■ In other words, each assignment (*i,..., ij) contributes Y2i=i f(Si) where Si is 
the set of coordinates labeled by i. Aggregating all the contributions of a given set S, we can define 
z e ,i,S as the sum of ye,^,...,^ over all choices where the coordinates labeled by i are exactly S. Then, 
the contribution of hyperedge e becomes 2i=i X)scm z e,i,sf(S). Moreover, the variables z e ^s are 
consistent with y v ^ for v € e in the sense that y Vi i = zZg-veS 

On the other hand, the contribution of a hyperedge e in the Lovasz relaxation is given by the 
Lovasz extension , f e (y v .i : v E e). It is known that the Lovasz extension is the convex closure of 
a submodular function, i.e. f e (y v ,i ■ v € e) is the minimum possible value of X^sc[z] z e,i,sf(S) subject 
to the consistency constraints y v .i = YIs-vgS z e,i,S> and z e ^s > 0. Therefore, the contribution of e to 
the Lovasz relaxation is always at most the contribution in the Basic LP. □ 

This means that the Basic LP is potentially a tighter relaxationand its fractional solution carries 
potentially more information than a fractional solution of the Lovasz relaxation. However, in some 
cases the two LPs coincide: We remark that for the Graph-MC problem, both relaxations are known 
to be equivalent to the CKR relaxation (see [5] for a discussion of the Lovasz relaxation, and |17| for 
a discussion of the “earthmover LP”, identical to our Basic LP). Our results also imply that for the 
Hypergraph-MC problem, both relaxations have the same integrality gap, 2 — 2/k. However, for 
certain problems the Basic LP can be strictly more powerful than the Lovasz relaxation. 

Hypergraph Multiway Partition. Given a hypergraph H = (V, E) and k terminals t \,..., tk € V, 
find a partition {A \,..., Ajf) of the vertices, so that i f(Ai) is minimized, where f(A) is the number 
of hyperedges cut by ( A , A). 

This is a special case of Sub-MP-Sym, because / here is the cut function in a hypergraph, a symmetric 
submodular function. The difference from Hypergraph Multiway Cut is that in Hypergraph Multiway 
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Partition, each cut hyperedge contributes 1 for each terminal that gets assigned some of its vertices 
(unlike in Hypergraph-MC where such a hyperedge contributes only 1 overall). 

Lemma 5.2. There is an instance of Hypergraph Multiway Partition where the Lovasz relaxation has 
a strictly lower optimum than the Basic LP. 

Proof. The instance is the following: We have a ground set V = {ti,£ 2 ,^ 3 , £ 4 ,£ 5 , 012 , ® 23 , « 34 , 045 , « 5 i}- 
The hyperedges are {ti,t 2 ,ai 2 }, {*2, * 3 , “23}, {*3, * 4 , <234}, {£4,^5,045}, {*5^1,051} with unit weight, 
and {012,023,034,045,051} with weight e = 0.001. 

The idea is that fractionally, each non-terminal a* j+i must be assigned half to ti and half to 
tj + 1, otherwise the cost of cutting the triple-edges is prohibitive. Then, the cost of the 5 -edge 
{012,023,034,045,051} is strictly higher in the Basic LP, since there is no good distribution consis¬ 
tent with the vertex variables (while the Lovasz relaxation is not sensitive to this). We verified by an 
LP solver that the two LPs have indeed distinct optimal values. □ 

6 The equivalence of integrality gap and symmetry gap 

In this section, we prove that for any Min-fi-CSP (specified by allowed predicates and candidate lists), 
the worst-case integrality gap of its Basic LP and the worst-case symmetry gap of its multilinear 
relaxation are the same. As we have seen, if we consider a fi-Min-CSP problem and 6 includes the 
Not-Equal predicate, then the integrality gap of the Basic LP implies a matching inapproximability 
result assuming UGC. Similarly, if the objective function can be viewed as Yli =1 /(<%) where / is 
submodular and S t is the set of vertices labeled i , then the symmetry gap implies an inapproximability 
result for this “submodular generalization” of the fi-CSP problem. In fact, the two hardness threshold 
often coincide as we have seen in the case of Hypergraph-MC and Sub-MP, where they are equal 
to 2 — 2 /k. Here, we show that it is not a coincidence that the two hardness thresholds are the same. 

We recall that the symmetry gap of a Min-CSP is the ratio s/c between the optimal fractional 
solution c and the optimal symmetric fractional solution s of the multilinear relaxation. The objective 
function in the multilinear relaxation is X(x) = E[/(x)], where x is an integer solution obtained by 
independently labeling each vertex v by i with probability s„ j. The notion of symmetry here is that 
I' on a ground set V' is invariant under a group Q of permutations of V'. A fractional solution is 
symmetric if for any permutation cr € Q and v' € V', v' and cr{v') have the same fractional assignment. 

The following theorem states that an integrality gap instance can be converted into a symmetry 
gap instance. 

Theorem 6.1. For any Q-CSP instance X(V, E, k, L v , h) whose Basic LP has optimum LP(T) = c, 
and the integer optimum is OPT(X) = s, there is a symmetric fi -CSP instance L'(V', E', k, L' v , h) 
whose symmetry gap is s/c. 

Proof. Given an optimal solution x of the Basic LP for instance X, without loss of generality, let us 
assume that all the variables in the solution have a rational value and there exists some M such that 
the values of all the variables, i.e, x Vjt and x eja , in the LP solution become integers if multiplied by 
M. For every vertex v € V, we define 

S v = {y £ [k] M ■ for every i € [&]; x v> i fraction of y 's coordinates have value i\. 

In other words, S v is the collection of strings in [k] M such that the portion of appearances of i in each 
string is exactly x v ^. 

We define a new instance on a ground set V' = {(v,y) : v € V, y € S v }. For every vertex 
(u, y) in V', its candidate list is L v , the candidate list of vertex v in instance X. Given an edge 
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e = (i?i, V 2 , ■ ■ ■, v\ e \) E E and its local distribution V e over [k]^ (implied by the fractional solution 
x et0l ), we call (y 1 , y 2 ,..., y^) E ([&] M )^ “consistent with e” if for every a € [k]^, the fraction of 
i E [M] such that {y \, y'f ,... ,y^) = a is exactly x e ^ a . It is easy to check (using the LP consistency 
constraints) that each y l must be in S Vi . We define S e to be the collection of all possible (y 1 , y 2 ,..., y^) 
that are consistent with edge e. 

For every edge e = (yi,V 2 , ■ ■ ■, v\ e \) and every (y 1 , y 2 ,..., yl e l) consistent with e, we add a constraint 
\k e over (uj^y 1 ),..., (uj,yl e l) with the same predicate function as edge e. We assign equal weights to 
all these copies of constraint d/ e , so that their total weight is equal to w e . 

Let Q be the group of all permutations n : [M] —» [M]. For any ir E Q, we also use the notation 
7 r(y) to indicate (y 7r (i),y 7r ( 2 )) • • • )2/tt(m)) for any U £ [k] M . Let us also think of Q as a permutation on 
V': for any (v,y) E V\ we map it to (v,tt( y)). Then it is easy to check that X' is invariant under 
any permutation a E Q. Also, for any (v, y), (v, y') E V ', since y and y' contain the same number 
of occurences for each i E [M\ (consistent with x V j), we know there exists some 7r E Q such that 
7 r(y) = y'. Therefore, a fractional solution of X' is symmetric with respect to Q if and only if (u, y) has 
the same fractional assignment as (u, ?/), for all y,y' E S v . 

Let us also assume that both X and X' are normalized with all their weights summing to 1; i.e, 
there is a distribution over the constraints with the probability being the weights. Essentially, X' 
is constructed by randomly picking a constraint *P e (ui, V 2 , ■ ■. ,v\ e \) in X and then randomly picking 
(y 1 ,y 2 ,..., yl e l) in S e . Then we add a constraint T e on (vi, y 1 ), (v 2 ,y 2 ), ■ ■ ■, (v\ e \,y^) in X'. 

Below, we prove that assuming there is a gap of s versus c between integer and fractional solutions 
for the Basic LP of X, there is a gap of at least s versus c between the best symmetric fractional 
solution and the best asymmetric fractional solution for the multilinear relaxation of X'. 


Some (asymmetric) solution of X' has cost at most c: Consider the following (random) assign¬ 
ment for X'\ we first choose a random i E [M] and then assign to each (v,y) the label y*. We know 
yi E L v , as it is a valid assignment for v by the definition of S v . Such an assignment has expected cost 


E 


u l e l 


e ' 

ae[fc]l e 


X, 


e,a 


T(a)] 


LP{X) = c. 


Therefore there is also some deterministic assignment of cost at most c. Our assignment is in fact 
integral, so the multilinear relaxation does not play a role here. 


Every symmetric fractional solution of X' has cost at least s: We know that a symmetric 
solution gives the same fractional assignment x v = (x V: i,x V: 2 , • • •, x v y) to vertex (v, y) for every y E S v . 
Also by definition of the multilinear relaxation, the value of this fractional solution is equal to the 
expected cost of independently choosing the label of each (y, y) from the distribution V v . which means 
i with probability x v ) j. Therefore, given a symmetric solution of X' specified by {x„ | v E V}, it has 
cost 


E 

e=(n,...,V|e| )eE,(y 1 ,.. 


■ jy^^GSeX^Vv. 


[^e(h,h, ■ ■ ■ ,l\e\)] 


(2) 


Since I 2 , ■ ■ ■, l\ e \ is independent of (y 1 , y 2 ,..., y l e l), we have 


E 

e (ill , 1)2 j. • • j^|e| 5^2 }••• j^|e| 


[^e(h,h, ■■■, l\e\)] 


where k is chosen independently from the distribution V Vi for the respective vertex Vi E e. This 
is exactly the cost on the original instance X if we independently label each v by sampling from 
distribution V v . Since the integer optimum of X is at least s, we have that fl2]) is also at least s. □ 
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In the other direction, we have the following. 

Theorem 6.2. For any symmetric 13 -CSP instance X(V,E,k,L v ,h) whose multilinear relaxation has 
symmetry gap 7 , and for any e > 0 there is a 13 -CSP instance X'(V', E ', k, L' v , h) whose Basic LP has 
integrality gap at least (1 — 6)7. 

Proof. Assume that X is an instance symmetric under a group of permutations G on the vertices V. 
For each v € V, we denote by uj(v ) the orbit of v, uj(v) = {o/u) : o € G}- We produce a new instance 
X' by folding the orbits, i.e. we identify all the elements in a given orbit. 

First, let us assume for simplicity that no constraint (edge) in X operates on more than one variable 
from each orbit. Then, we just fold each orbit co(v) into a single vertex. I.e., V' = {w(u) : v € V}. 
(We abuse notation here and use u(v) also to denote the vertex of V corresponding to the orbit u(v).) 
We also define E' to be edges corresponding one-to-one to the edges in E, with the same predicates; 
i.e. each constraint 1 F e (.Ti, • • • ,x\ e \) in X becomes 'h e (a;(xi),... ,<u(x| e |)) in X'. The candidate list for 
uj(v) is identical to the candidate list of v (and hence also to the candidate list for any other w € uj(v), 
by the symmetry condition on X). 

Now consider an optimal fractional solution of the multilinear relaxation of X, with variables x v j. 
We define a fractional solution of the Basic LP for X', by setting x,u(v),i = Y^weu(v) x w,i- We 
claim that the edge variables x e ^ Q can be assigned values consistent with x w ( v \ t in such a way that 
the objective value of the Basic LP X' is equal to the value of the multilinear relaxation for X. This 
is because the value of the multilinear relaxation is obtained by independently labeling each vertex v 
by i with probability x v i j. We can then define x e ^ a as the probability that edge e is labeled a under 
this random labeling. This shows that the optimum of the Basic LP for X' is at most the optimum 
of the multilinear relaxation of X (if we have a minimization problem; otherwise all inequalities are 
reversed). 

Now, consider an integer solution of the instance X 7 , i.e. a labeling of the orbits co(v) by labels in 
[k]. This induces a natural symmetric labeling of the instance X, where all the vertices in each orbit 
receive the same label. The values of the two labelings of X and X' are the same. (Both labelings 
are integer, so the multilinear relaxation does not play a role here.) This shows that the symmetric 
optimum of the multilinear relaxation of X is at most the integer optimum of X'. Together with the 
previous paragraph, we obtain that the gap between integer and fractional solutions of the Basic LP 
of X' is at least the gap between symmetric and asymmetric solutions of the multilinear relaxation of 
X. This completes the proof in the special case where no predicate takes two variables from the same 
orbit. 

Now, let us consider the general case, in which a constraint of X can contain multiple variables from 
the same orbit. Note that the proof above doesn’t work here, because it would produce predicates in X' 
that operate on the same variable multiple times, which we do not allow. (As an instructive example, 
consider a symmetric instance on two elements {1,2} with the constraint l± I2 , which serves as 
a starting point in proving the hardness of (1/2 + e)-approximating the maximum of a nonnegative 
submodular function. This instance is symmetric with respect to switching 1 and 2 , and hence the 
folded instance would contain only one element and the constraint l\ l\. which is not a meaningful 
integrality gap instance.) 

Instead, we replace each orbit by a large cluster of identical elements, and we produce copies of 
each constraint of X, using distinct elements from the respective clusters. As an example, consider 
the 2 -element instance above. We would first fold the elements { 1 , 2 } into one, and then replace it by 
a large cluster C of identical elements. The original constraint l\ I2 will be replaced by the same 
constraint for every pair of distinct elements {?-,/} C C. In other words, the new instance is a Max 
Cut instance on a complete graph and we consider the Basic LP for this instance. The symmetry 
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gap of the original instance is 2, and the integrality gap of the Basic LP is arbitrarily close to 2 (for 
\C\ —> oo). 

Now let us describe the general reduction. For each orbit u(v) of X, we produce a disjoint cluster 
of elements The candidate list for each vertex in C u r v ) is the same as the candidate list for 

any vertex in u>(v) (which must be the same by the symmetry assumption for X). The ground set 
of X' is V' = Ut,ey Cw(v)- Tor each edge e = (iq,... ,v\ e \) of X, we produce a number of copies by 
considering all possible edges e 7 = (v \,..., uK) where v[ € C u ^ and v[,, .., vC are distinct. We use 
the same predicate, ’P e / = T e . The edge weights w e < are defined so that they are equal and add up 
to w e . Let us assume that X, X 7 are minimization problems. Let c be value of the optimal solution of 
the multilinear relaxation of X, and let s be the the value of the optimal symmetric solution of the 
multilinear relaxation of X. 

The fractional optimum of X' is at most c: Let x V)l be an solution of the multilinear relaxation 
of value c. We define a fractional solution of the Basic LP for X' to be equal to av,i = Y2 w eu(v) Xl »,i 
for each v' € C w t v \ . Similar to the construction in the simpler case above, we define the edge variables 
x e i' Q to simulate the independent rounding that defines the value of the multilinear extension of X; 
specifically, we define x e ^ a for e 7 = (u>(vi),ui(v 2 ), • • •, cu(v| e |)) to be the probability that e = (v\,..., v\ e \) 
receives assignment a, where e is the edge that produced e! in our reduction. By construction, the 
value of this fractional solution for X 7 is equal to the value of the multilinear relaxation for X which is 
c. 

The integer optimum of X 7 is at least s: Here, we consider any integer solution of X 7 and 
we produce a symmetric fractional solution for X 7 . If the integer solution for X 7 is denoted by l(v'), 
we consider each cluster C w t v ) in X 7 and we define x v ^ to be the fraction of vertices in C^u,) that are 
labeled i. Note that by definition, x V)l = x W j for all w € oj{v) and hence x v . L depends only on the 
orbit of v. This means that x v ^ is a symmetric fractional solution. Also, it respects the candidate lists 
of X because X 7 has the same candidate lists and hence the fractional solution uses only the allowed 
labels for each vertex v. The value of this symmetric fractional solution is given by the multilinear 
extension, i.e. the expected value of a random labeling where each label is chosen independently with 
probabilities x Vtl . The final observation is that in the limit as \C,^^ \ —>• oo, this is very close to the 
value of the integer solution l(v') for X 7 : this is because the edges e 7 in X 7 are defined by all possible 
choices of distinct elements in the respective clusters. In the limit, this is arbitrarily close to sampling 
independently random vertices from the respective clusters, which is what the multilinear relaxation 
corresponds to. Therefore, the optimal symmetric fractional solution to X cannot be worse than the 
integer optimum of X 7 in the limit. 

In conclusion, the integrality gap of the Basic LP for X 7 can be made arbitrarily close to the 
symmetry gap of X, s/c. □ 
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A Missing proofs from Section [3] 

A.l Hardness for symmetric Sub-MP 

Here, we prove Theorem 13.51 We follow the symmetry gap approach of the proof of Theorem 13. 11 only 

the symmetric instance is different here. 


The symmetric instance. Let V = [k] x [k]. It is convenient to think of the elements of V as 
belonging to “rows” Ri = {{i,j) | 1 < j < k} and “columns” C* = {(j,i) \ 1 < j < k}. The elements 
[i,i) are the terminals, i.e, t* = We define our cost function / as follows. 

Let 7 E [0, k] be a parameter; we assume for convenience that 7 is an even integer. Let f : M —> R 
be the following function: 


(j)(t) = 


t 


For each i E [k], let gi : 2 


v 


2k — t — 7 
be the following function: 


if t < k — 7 / 2 , 
otherwise. 


9i(S) = 




if U <£. S, 
otherwise. 


The function / : 2 V — > M + is defined by f(S ) = X^=i 9i(S H Ri) + Y)i =1 9 i(SnCi). It is straightforward 
to verify that / is non-negative, submodular, and symmetric. As before, the instance is symmetric 
under swapping the vertices (i, j) and (j, i) for all i / j. 
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Computing the symmetry gap. 


Theorem A.l. Let 7 = 2[2k — y /3 k 2 — 2k\. Then the gap between the best symmetric and asymmetric 
assignment for the instance defined above is at least — O(^) & 1.2679 — 0(%). 

Definition A.2. We say that element ( i,j ) is unhappy if it is assigned to a terminal l (f {i, j}. An 
assignment is nice if it has the following properties: 

(a) Each unhappy element is assigned to terminal 1. 

(b) For each i 1, the number of elements in row Ri that are assigned to terminal i is either 0 or 
at least 7 / 2 . Similarly, for each i fi\, the number of elements in column Ci that are assigned to 
terminal i is either 0 or at least 7 / 2 . The number of elements in row R\ and column C\ that are 
assigned to terminal 1 is at least 7 / 2 . 

(c) For each i 1, if the number of unhappy elements in row Ri is at least k — 7 / 2 , all the elements 
in Ri \ {(i,i)} are assigned to terminal 1. Similarly, for each i 1, if the number of unhappy 
elements in column Ci is at least k — 7 / 2 , all the elements in Ci \ {(i, i)} are assigned to terminal 

1 . 

Theorem I A. II will follow from the following lemmas. 

Lemma A.3. Consider a symmetric assignment of cost C. Then there exists a nice symmetric 
assignment whose cost is at most C + 6k. 

Lemma A.4. Let 7 = 2 [2k — y/2>k 2 — 2k\. Then the cost of any nice symmetric assignment is at least 
(8^3-12 )k 2 -0{k). 

Proof of Theorem I A. 11 Note that it follows from Lemma IA.3I and Lemma IA.4I that the optimal sym¬ 
metric assignment has cost 

OPT > (8\/3 - 12) A : 2 - 0(k). 

Now we claim that the optimal asymmetric assignment has cost 

OPT < 2 k 2 - 7 k< 2k 2 - 4 k 2 + 2AV3A2 + 0(k) = (2^3 - 2 )k 2 + 0(k). 

Consider the assignment that assigns the i- th row R,j = {(i,j) \ 1 < j < k} to terminal i. We have 

k k 

f(R,) = E gi(RinRe) + ^2g t (Ri n Q) 

(.=1 1 =1 

= gi(Ri) + gi{{{i,i)}) +^2ge{{(iJ)}) 

e^i 

= (j>(0) + cj)(k - 1) + (k - 1 )^( 1 ) 

= 0 + (k + 1 — 7 ) + (k — 1 ) • 1 = 2k — 7 . 

Therefore the total cost of the assignment is k(2k — 7 ) and the theorem follows by plugging in the 
value of 7 . □ 
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A. 1.1 Proof of Lemma IA.31 


Consider a symmetric assignment. Let A^ denote the set of all elements that are assigned to terminal l. 
Note that, for each z, row Rj and column have the same number of unhappy elements; additionally, 
they have the same number of elements assigned to terminal i. Let n t = \Ai n Ri\ denote the number 
of elements in Ri that are assigned to terminal i and let zzj denote the number of elements in Ri that 
are unhappy. Let um denote the number of unhappy elements in Ri that are assigned to terminal l. 
Let I be the set of all indices i such that rii < 7 / 2 . 

We reassign elements as follows. For each i € I \ {1}, i.e. such that row Ri has fewer than 7/2 
elements assigned to terminal z, we reassign all the elements in (Ri U Ci) n (Ai \ {(z, z)}) to terminal 1. 
If row R\ has less than 7/2 elements assigned to terminal 1, we reassign all the elements in R\ U C\ to 
terminal 1. We reassign each remaining unhappy element to terminal 1. Finally, for each i such that 
row Ri has at least 7 — 7/2 elements assigned to terminal 1, we reassign all elements in f?.;UCi\{(z,z)} 
to terminal 1. Note that the resulting assignment is symmetric. 

Let (A\, ■ ■ ■ ,A ' k ) denote the resulting assignment, where A ^ is the set of all elements that are 
assigned to terminal £, and let C' be its cost. Let J denote the set of all indices z such that all of the 
elements of Ri \ {(z, z)} have been reassigned to terminal 1; more precisely, A’ x n Ri D Ri \ {(z,z)}. 

Claim A.5. For each i € I, gi(A' i r\Ri ) < —zzj + 1. For each i € I, gi(A’ i nRi) = gi(Ai<lRi). 

Proof. Consider an index i € /. Suppose first that i = 1. Note that we reassigned all of the elements 
in R\ to terminal 1. Thus g\ (A\ n R\) = <f(k — \A\ n R\\) = 0. Since n\ < 7/2 and 7 < k, 
gi(A\ FI R\) = <fi(k — \Ai n f?i|) = k — 7 + n\ > n\. Therefore we may assume that z 7 ^ 1. Note that 
we reassigned all of the elements in Ai FI Ri \ {(z, z)} to terminal 1 and thus \A( n Ri\ = 1. We have 
gi(Aj n Ri) = 4>(k - |Ai fl Ri\) = k - 7 + rz* and gi(A’ i n Ri) = f(k - \A( fl Ri\) = k — 7 + 1, which 
gi(Ai n R) < gi(Ai n R) - + 1. 

Consider an index i € /. Since all of the elements of Aj n Ri are happy, we have A( n Ri = Aj n Ri 
and therefore gi(A! { n Rf) = gi(Ai n Ri). □ 

Corollary A.6 . We have J2i=i 9i(K Fl Ri) < Ylt= 1 9 i(A Fl Ri) - J2iei n i + k - 

Claim A.7. For each i € J — {1}, g t (A\ n Ri) < gi(A t fl Rf) — rii +1. For each i € 1(1 (J — {1}), 
g i (A , 1 nR i ) < gi(Air\Ri) + Ui-Uii+rn. For each i e 7n(J-{l}), g i (A’ l ^R i ) < gi(AinRi) + Ui-un. 

Proof. Consider an index z € J — {1}. Since we reassigned all elements of Ri — {(z,z)} to terminal 1, 
gi(A\ n Ri) = 4>(k— 1) = 7 — 7 + 1. Suppose there exists anf /i such that | Ai* nf?j| > 7 — 7 / 2 . Then 
gi(A(* n Ri) = 4>(\Ai* Fl Ri\) = 27 — 7 — \ Ag* Fl Rf\. Additionally, for each l ^ {£*,z}, \A^ fl Rf\ < 7/2 
and thus gi(Ai fl Rf) = f(\Ae fl Ri\) = \At fl Ri\. Therefore 

J29i(A e nR i ) = 2k- 1 -\A e *nR i \+ \ A e^Ri\ 

l£{t* ,1} 

= 27 — 7 — 2|A^» nRi\ + k-rn 

> 37 — 7 — rii — 2(7 — rii) (since | A^* Fl Ri\ < 7 — rii) 

= 7 — 7 + zzj. 

Therefore gi(A[ nR i ) = k-'y + l< gi(A e n Ri) - n* + 1. 

Thus we may assume that \AgP\Ri\ < 7 — 7/2 for each l ^ i. For each £ 7 ^ z, we have gifAgCi Ri) = 
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<f>(\Ae n Ri\) = I A( n Ri\. Thus 

22di(A £ fl Ri) = k- rii, 

(Ai 

9i(A'i n Ri) = k - 7 + 1 

= 22 Si (A £ n Ri) + m - 7 + 1 
(Ai 

< X 9 i(Ae n R t ) - ri t + 1 (since 7 > 0). 

(Ai 

Consider an index i € I n (J \ { 1 }). Then n* < 7/2 and m + n* < k — 7/2. The elements of Ri 
that we reassigned to terminal 1 were the unhappy elements and the elements of Ai n Ri- Note that, 
since i 7^ 1 , at most one element of A\ fi Ri is happy (namely the element (i, 1) G C 1 ). Therefore 

\A' l n Ri\ < Ui + rii + 1 < k — 7/2 and \A\ n Ri\ < un +1 < k — 7/2. Thus gi(A[ n Ri) = <f>(\A! x n Ri\) = 

| A[ nRi\ = \Ai nRi\+Ui- un + rii and gi(Ai n Ri) = <l>(\Ai n Ri\) = \Ai n Ri\. 

Consider an index i G I D (J \ {1}). Then n,; > 7/2 and Ui < k — 7 / 2 . The elements of Ri that 
we reassigned to terminal 1 were the unhappy elements. Note that at most one element of A\ n Ri is 
happy and therefore | A\ n R t \ < u t + 1 < k — 7 / 2 . Additionally, \A\ DR,) < un + 1 < k — 7 / 2 . Thus 
giiA'^Ri) = <p(\A[nRi\) = \A[nRi\ = lA^Ril+m-un and g-^nRi) = ^A^Ri]) = |Aini2i|. □ 

Claim A. 8 . For each i G J and each £ ^ {i, 1}, gi(A' £ n Ri) = 0. For each i G J and each £ ^ {i, 1}, 
g i {A! i n Ri) < gi(A £ n Ri) - u i£ . 

Proof. Consider an index i G J. Then Ri\{(i, i)} C A \. It follows that, for each £ ^ {i, 1}, A' £ DRi = 0 
and gi(A' e fl Rf) = 4>(\A' e fl R z j) = 0. 

Consider an index i € J and an index £ {i. 1}. Since i G J, we have Ui < k — 7 / 2. Note 

that at most one element of A £ fl Ri is happy and thus | A £ n Ri\ < um + 1 < k — 7 / 2 . Moreover, 

| A' e fl Ri\ = I A £ fl Ri\ - u i£ < 1. Thus gi{A £ n Ri) = 4>(\A £ fl Ri\) = I A £ C Ri\ and g i {A' £ fl Ri) = 

<K\A' e n Ri\) = I A! t n Ri\ = I a £ n Rf - u i£ . □ 

Corollary A.9. We have 

X X gi ( A e nR i) ^ X X 9i( A e n R >) - X n * + k - 

ieJ—{1} (-Ai i£J—{ 1} (Ai i£J—{ 1} 

Proof. By the preceding claims, we have 

X Si(A\ nR. t ) < 22 X 9i( A * n Ri) ~ X n i + k 

i£j—{ 1} iSJ—{1} (Ai iSJ—{1} 

X X 9i(A £ n R i) = 0 

i£j-{ 1} t${i, 1} 

X '22si{A' £ nRi)= 22 9i(A 1 nR i )+ 22 X 

i£j-{l} (Ai i£J-{ 1} ieJ-{l}e<£{i,l} 

< X ^2 9i(A £ D Ri) - 22 n i + k. 

i£j—{ 1} (Ai iSJ—{1} 

□ 
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Corollary A.10. We have 


E E gi {A' e nRi) < E E^ n ^)+ E n - 

*eJ-{i} ieJ-{i} tyi ie/n(J-{i}) 


Proof. We have 


E n -R*) < E Si(^i n R*) + E “ “ii) + E n * 

ieJ~{i} ieJ-{ 1 } ieJ-{i} ie/n(J-{i}) 

E E 9iAt n Ri) < E E 9i(Ai n Ri) - E E Uii 

ieJ-{ I}^{i,i} *eJ-{i} ieJ-{i}^{*,!} 

= E E E 

E E^(^ n ^)= E n R t ) + E E 

isJ—{1} W* *eJ-{i} ieJ-{i} ^{*,1} 

= E E^ n ^)+ E n *- 

ieJ-{i} VAi ie/n(J-{i}) 

□ 

Claim A.11. We have 

YjnAn n Ri) < k. 

Proof. Consider an l 1. Note that we may assume that 1 ^ J, since otherwise Ajflfli =0 and 
< 7 i (A' ( n Ri ) = </>(|A^ fl i?i|) = 0. Since at most one element of An n R\ is happy and we reassigned all 
unhappy elements of A?i to terminal 1, we have \A' t n i?i| < 1 and thus (j\ (A' ( n R\) = <f{\ A' e fl A?i|) 

1 . 


By symmetry, the cost over columns is the same as the cost over rows. It follows that the cost C' of 
the resulting assignment satisfies 


k k 


c'-E/(4) = *EE 9i Ai C Ri 


e=i 


i =i e=i 


2 (E 9iAi hi Rf) + E 9iAe n Ri) + E E 9iAt n R») + E E 9iAt ^ R» 

. i=l ^/1 iGJ—{1} iGJ—{1} 

k k 

S *|EE fl A?*) - E n ‘- E n * + E n* + 3/c 

. *=1 <=i iei i£j-{ 1 } ie/n(J-{i}) 

fc k \ 


£ 2 EE 9i(At n -R?:) + 3A: 


\ 2=1 £=1 
= C + 6k. 
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A.1.2 Proof of Lemma I A. 41 


Consider a nice symmetric assignment. We define a coloring of the complete graph I\f as follows. 
Vertex i represent terminal (i,i) and the edge (i,j) represents the pair (i,j), where i ^ j. We use the 
symmetric assignment to define a coloring of the edges of K Consider a pair (i,j) where i ^ j , and 
suppose (i, j) is assigned to terminal £; we assign color £ to the edge (i, j) of K^. (Since the assignment 
is symmetric, (i,j) and (j,i) are both assigned to the same terminal.) Let En denote the set of all 
edges of K}. that have color £, and let TV,- denote the set of all edges incident to vertex i. We say that 
an edge {i. j) is unhappy if the element (i,j) is unhappy. Let g denote the number of unhappy edges. 

We partition [k] into L and L = [k\ — L, where L is the set of all vertices i such that \Ei n W| < 
7/2 — 1. Note that since the assignment is nice, we actually have |Ei D Ni\ = 0 in this case. Note 
that vertices i € L are “lonely”, i.e., none of the edges incident to i have color i. Moreover, 1 ^ L, 
since vertex 1 is incident to at least 7/2 — 1 edges of color 1. We further partition L into L\ and 
L 2 = L — Li, where L\ is the set of all vertices iGf such that all the edges incident to i have color 1. 

Let a = \Li\, b = I-L 2 I, and c = \L\ = k — a — b. Let C be the cost of the assignment. Recall that 
p is the number of unhappy edges. 

Claim A. 12. We have 

C > 2 k(k — 2 ) — 2(2a + b )7 + 4(a + b) + 4 g. 

Proof. Let An denote the set of all elements that are assigned to terminal £. Let Ri (resp. Cf) denote the 
i -th row (resp. the z-th column). Note that, since the assignment is symmetric, g^A^nRi) = gi(AnCiCi). 
Therefore 

y = ^E/(^) = EE^niJ i ) 

z e =1 1=1 i=x 

k 

= gi{Ai n Rf) + gi(A £ n Ri) 

i =1 

k k 

= J2 <Kk -1 - | Ei n Ni\) + J2J2 ^ n ^D- 

i =1 i=l £^i 

We consider the two sums separately. 
k 

X>( fc - 1 - I Ei nNi\) = J2 <t>(k -1 - \Ei n Nil) + - 1 - |£i n Wl). 

*=1 i&L i£L 

For each i € L, \Ei n Ni\ = 0 and thus <j>{k — 1 — |£) n W|) = <j>{k — 1). For each i € L, \Ei n Ni\ > 7/2 
and thus <f(k — 1 — |E) n Ni\) = k — 1 — |£) n Ni\. Thus 

k 

J2 <Kk -1 - \Ei n Nil) = ^ - 1)|£| + (k - 1)\L\ - \Ei n Nil 

i= 1 i&L 

= k(k - 1 ) - (7 - 2 )(o + b) - \Ei n Ni\. 

i&L 

Now consider the second sum. Consider an index i G Li and an index f / i. If £ = 1, we have 
|An n N\ = k — 1 and, if £ / 1, we have have |An n N\ = 0. Since 1 ^ L\, we have 

EE <KI E t n w|) = ^ <X|£i n N\) = <K* - i)M- 

i£Li 
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Therefore we have 


EE WEenNil) = EE 4>{\Ei n i\Tj|) + EE <f>(\E e nNi\) + EE cp(\E e nNi\ 

i= 1 i€Li i(zL2 ij-i i^L ^ 7 ^ 

= (j>{k — l)|Ti| + EE <K\E e nN i \) + EE l^nJVil 

1 GL 2 bfci i£L 

= (k + 1 — 7 )a + EE n Ni\) + EE 


iGX/2 


iGL ^ 


Consider an index i € +2 \ {1}. If there exists an i i such that | E( n N t | > k — 7 / 2 , at least fc — 7/2 
edges incident to i are unhappy and therefore all of the edges incident to i have color 1. But i has at 
least 7/2 edges incident to it that have color i and therefore \E( n A r ,;| < k — 7/2 for all i 7 ^ i. Thus, 
for each i € L 2 \ {1}, we have 

Y^^(\E £ nN i \) = Y,\EenN l \. 

i+i i+i 

Therefore 

k 

|^nJVi|)>(A: + l-7)o-fc+ J2\E t nNi\. 

i =1 ISI/2UL 

By combining the inequalities derived so far, it follows that 

C 

— + k(k — 1) + /c(a — 1) — (2a + 6)7 + 3a + 26 — ^ ^ |-ZTj n A^| + E E |^niVi|. 


ieL 


i£L 2 UL l+i 


Recall that, for each i £ Li, all of the edges incident to i have color one. Thus Ez eLi Z^z l^niV, 
E ieL i |-Ei n iVj| = (k — l)a, and therefore 



k 

k(k — 2) — (2 a + + 2(a + 6) — ^ ^ l-E 1 ^ Pi A^| + EE \e £ n Ni 

ieL i =1 


In fact, the sum of L can be replaced by a sum over all i, because vertices in L do not have any edge of 
color i incident to them. Each unhappy edge contributes 2 to the sum Ei=i E^+. \Ee H Ni\, and each 
happy edge contributes 1 to the sum E*=i Ej?/« | E^ n N t \. Moreover, each unhappy edge contributes 

0 to the sum E*=i \Ei n iY,;|, and each happy edge contributes 1 to the sum E*=i \E% fl N t \. Recall 
that the number of unhappy edges is p. Thus we have 


Therefore 


k 


EE \E e n Ni 

i= 1 i^i 


Y i \Ei^ N i 

ieL 


k 


EE \E e n Ni 

i= 1 l±i 


k 

^2 \Ei n N i 

i= 1 


2 /r. 


C 

~2 


> — 2 ) — ( 2 a + 6)7 + 2 (a + 6 ) + 2/i. 


□ 


Claim A.13. The number of unhappy edges /j satisfies 


T > 


(a + 6 )(a + 6 - 1 ) 
2 


+ a(k — a — 6 — 1 ). 
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Proof. For each vertex i € L, there are no edges incident to L that have color i. Therefore all edges 
with both endpoints in i are unhappy; there are (a + 6 )(a + 6 — l )/2 such edges. Additionally, the 
edges incident to a vertex in L\ have color 1. Therefore all edges (i,j) such that i G Li and j € L and 
j / 1 are also unhappy; there are at least a(c — 1 ) = a(k — a — 6 — 1 ) such edges. □ 


Proof of Lemma A.f. It follows from the previous claims that the total cost C satisfies 


C > 26(6 - 2 ) + 2a(2k - 2 7 - a - 2 ) + 26(6 - 7 ). 


Let 6 (a, 6 ) = 2 k(k — 2) + 2a[2k — 27 — a — 2) + 26(6 — 7 ). The function 6 (a, 6 ) is concave in a and 
therefore it is minimized when a = 0 or a = k. 

Consider the case a = 0. By taking the derivative of 6 (0, 6 ) with respect to 6 and setting it to 
zero, we see that 6,(0, 6 ) is minimized when 6 = 7 / 2 . Consider the case a = k. Since a + 6 < k, we 
have 6 = 0. Thus C > min{6(0, 7 / 2 ), 6 ( 6 , 0)}. We choose 7 = 2(26 — y/3k 2 — 26J which makes the 
two expressions equal (up to the rounding of 7). This implies 

C> (8\/3- 12 ) 6 2 -0(6). 


□ 


A.2 From oracle hardness to computational hardness 

Here we explain how the hardness results in the value oracle model (the first parts of Theorem 13.11 
and 13.51) imply the analogous computational hardness results in these theorems. The key technique 
here is the use of list-decodable codes for the encoding of a “hidden partition”, as introduced in [ 8 ]. 
The technique, although presented in [ 8 ] for submodular maximization problems, applies practically 
without any change to submodular minimization as well. For concreteness, let us summarize how this 
technique applies to the Sub-MP problem: 

We generate instances on a ground set X = N x V, where V = [6] x [k] is the ground set of our 
initial symmetric instance. The goal is to present an objective function / : 2 X —>• M_|_ in a form that 
makes it difficult to distinguish the elements in sets N x {(z,j)} from elements in N x for any 

fixed i. j. More precisely, we construct a partition of N x V into sets {A^ : (i. j) € [ 6 ] x [ 6 ]} such that 
it is hard to distinguish the elements in A 7 from the elements in A ]t . 

We accomplish this by way of a list-decodable code as follows (the construction is simplified for this 
particular case): We consider a Unique-SAT instance <j> with rn variables, and an encoding function 
E : {0, l} m — > {0, iJ.^ Vx ( 2 ) . The satisfying assignment x* of (j) (if it exists) implicitly defines a partition 
of A x [ k ] x [k] into subsets {A 7 : ( i,j) e [k] x [ 6 ]}, by specifying that for i < j, 

A^ = {( a,i,j) € N x [k] x [k] : = 0} U {(a,j,i) € N x [k] x [k] : (E(x*)) a ^ itj y = 1}. 

In other words, the string E(x*) tells us which pairs of elements ( a,i,j ), ( a,j,i ) should be swapped, 
to obtain the sets A tJ from the sets N x {(i,j)}. In particular, if E(x*) is the all-zeros string, then 
Aij = N x {(i, j)}. For i = j , we define An = N x {(i, z)} in every case. 

The encoding of an objective function for Sub-MP then consists purely of the Unique-SAT instance 
(f, which is interpreted as follows. If f is satisfiable, it induces the sets A t] as above, and the objective 
function is understood to be f(S) = F(£), where £7 = and F is the smooth submodular 

function we described in Section [3j If 0 is not satisfiable, then the objective function is understood to 
be f(S) = (?(£), with the same notation as in Section [3l We know that the gap between the optima 
in these two cases can be made arbitrarily close to 2 — 2 / 6 , and if we could distinguish the two cases, 
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we could determine whether a Unique-SAT instance has 0 or 1 satisfying assignments, which would 
imply NP = RP. 

The question, however, is whether this is a legitimate encoding of the objective function, in the 
sense that given the encoding 4> and a set S, the value of f(S) can be computed efficiently. The main 
insight of |8] is that list-decodable codes allow us to evaluate such functions efficiently. The reason for 
this is that the functions F(x),G(x) are constructed in such a way (going back to [21], see Lemma lT2l) 
that their value depends on the hidden partition A^j only if x is significantly unbalanced, meaning that 
Yha^Aij x a ~ YlaeAji x a > P n for some i,j € [k\ and constant (3 > 0. Equivalently, this means that to 
evaluate /(S') and g(S), we need to determine the hidden partition only if \S n A t j\ — |S n Aji\ > j3n 
for some i , j E [k]. Fortunately, there are known list-decodable codes that allow us to determine the 
satisfying assignment x* and the induced partition {Aij : i,j E [A:]}, given any such set S (because it 
corresponds to a corrupted message which is relatively close to the codeword E(x*)). 

To summarize, for a given set S, we are either able to determine x* and the induced partition 
{A^ : i,j E [A;]}, in which case we are obviously able to evaluate f(S) or g(S)\ or we conclude that 
S does not satisfy |S n Ajj \ — |S n Aj % \ > /3n for any i, j E [k], and in this case we do not need to 
know the partition [A^ : i, j E [ k ]} because f(S) = g(S ) and g does not depend on the partition of 
A^ U Aji into A^ and Aji. This concludes the proof that we presented a legitimate encoding of the 
objective function. 

For further details on this technique to convert oracle hardness results into computational hardness, 
we refer the reader to [8]. 
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